DJL Serving 0.26.0 - wlm
This document is the API specification for the DJL Serving WorkLoadManager.
This module provides the worker and thread management for a high-performance inference server. See here for more details.
Package
Description
Contains the model server backend which manages worker threads and executes jobs on models.
Contains utilities to support the
WorkLoadManager
.