Interface AsyncInferenceClientConfig.Builder

    • Method Detail

      • maxConcurrentInvocationsPerInstance

        AsyncInferenceClientConfig.Builder maxConcurrentInvocationsPerInstance​(Integer maxConcurrentInvocationsPerInstance)

        The maximum number of concurrent requests sent by the SageMaker client to the model container. If no value is provided, SageMaker chooses an optimal value.

        Parameters:
        maxConcurrentInvocationsPerInstance - The maximum number of concurrent requests sent by the SageMaker client to the model container. If no value is provided, SageMaker chooses an optimal value.
        Returns:
        Returns a reference to this object so that method calls can be chained together.