DJL Serving 0.26.0 - wlm

This document is the API specification for the DJL Serving WorkLoadManager.

This module provides the worker and thread management for a high-performance inference server. See here for more details.

Packages
Package
Description
Contains the model server backend which manages worker threads and executes jobs on models.
Contains utilities to support the WorkLoadManager.