- generate() - Method in class ai.djl.serving.wlm.WorkerIdGenerator
-
generate a new worker id.
- getBatchSize() - Method in class ai.djl.serving.wlm.ModelInfo
-
Returns the configured batch size.
- getBegin() - Method in class ai.djl.serving.wlm.Job
-
Returns the job begin time.
- getDefaultWorkers(NDManager, String, int) - Method in class ai.djl.serving.wlm.util.WlmConfigManager
-
Returns the default number of workers for a new registered model.
- getFuture() - Method in class ai.djl.serving.wlm.util.WorkerJob
-
Returns the future for the job.
- getGpuId() - Method in class ai.djl.serving.wlm.WorkerThread
-
Returns the gpu id used by the thread.
- getInput() - Method in class ai.djl.serving.wlm.Job
-
Returns the input data.
- getInstance() - Static method in class ai.djl.serving.wlm.util.WlmConfigManager
-
Returns the singleton ConfigManager
instance.
- getJob() - Method in class ai.djl.serving.wlm.util.WorkerJob
-
- getJobQueue() - Method in class ai.djl.serving.wlm.WorkLoadManager.WorkerPool
-
Returns the JobQueue
for this model.
- getMaxBatchDelay() - Method in class ai.djl.serving.wlm.ModelInfo
-
Returns the maximum delay in milliseconds to aggregate a batch.
- getMaxIdleTime() - Method in class ai.djl.serving.wlm.ModelInfo
-
returns the configured maxIdleTime of workers.
- getMaxWorkers() - Method in class ai.djl.serving.wlm.WorkLoadManager.WorkerPool
-
Returns the maximum number of workers for a model.
- getMinWorkers() - Method in class ai.djl.serving.wlm.WorkLoadManager.WorkerPool
-
Returns the minimum number of workers for a model.
- getModel() - Method in class ai.djl.serving.wlm.Job
-
Returns the model that associated with this job.
- getModel() - Method in class ai.djl.serving.wlm.ModelInfo
-
Returns the loaded ZooModel
.
- getModelDir() - Method in class ai.djl.serving.wlm.ModelInfo
-
Returns the model cache directory.
- getModelName() - Method in class ai.djl.serving.wlm.ModelInfo
-
Returns the model name.
- getNumRunningWorkers(ModelInfo) - Method in class ai.djl.serving.wlm.WorkLoadManager
-
Returns the number of running workers of a model.
- getQueueLength(ModelInfo) - Method in class ai.djl.serving.wlm.WorkLoadManager
-
Returns the current number of request in the queue.
- getQueueSize() - Method in class ai.djl.serving.wlm.ModelInfo
-
Returns the configured size of the workers queue.
- getScheduled() - Method in class ai.djl.serving.wlm.Job
-
Returns the job scheduled time.
- getStartTime() - Method in class ai.djl.serving.wlm.WorkerThread
-
Returns the thread start time.
- getState() - Method in class ai.djl.serving.wlm.WorkerThread
-
Returns the worker state.
- getVersion() - Method in class ai.djl.serving.wlm.ModelInfo
-
Returns the model version.
- getWorkerId() - Method in class ai.djl.serving.wlm.WorkerThread
-
Returns the worker thread ID.
- getWorkerPoolForModel(ModelInfo) - Method in class ai.djl.serving.wlm.WorkLoadManager
-
- getWorkers(ModelInfo) - Method in class ai.djl.serving.wlm.WorkLoadManager
-
Returns the workers for the specific model.
- getWorkers() - Method in class ai.djl.serving.wlm.WorkLoadManager.WorkerPool
-
Returns a list of worker thread.