Overview (DJL Serving 0.26.0

This document is the API specification for the DJL Serving WorkLoadManager.

This module provides the worker and thread management for a high-performance inference server. See here for more details.

Packages

Package

Description

Contains the model server backend which manages worker threads and executes jobs on models.

Contains utilities to support the WorkLoadManager.

DJL Serving 0.26.0 - wlm