Interface ScalingPolicyMetric.Builder
-
- All Superinterfaces:
Buildable
,CopyableBuilder<ScalingPolicyMetric.Builder,ScalingPolicyMetric>
,SdkBuilder<ScalingPolicyMetric.Builder,ScalingPolicyMetric>
,SdkPojo
- Enclosing class:
- ScalingPolicyMetric
public static interface ScalingPolicyMetric.Builder extends SdkPojo, CopyableBuilder<ScalingPolicyMetric.Builder,ScalingPolicyMetric>
-
-
Method Summary
All Methods Instance Methods Abstract Methods Modifier and Type Method Description ScalingPolicyMetric.Builder
invocationsPerInstance(Integer invocationsPerInstance)
The number of invocations sent to a model, normalized byInstanceCount
in each ProductionVariant.ScalingPolicyMetric.Builder
modelLatency(Integer modelLatency)
The interval of time taken by a model to respond as viewed from SageMaker.-
Methods inherited from interface software.amazon.awssdk.utils.builder.CopyableBuilder
copy
-
Methods inherited from interface software.amazon.awssdk.utils.builder.SdkBuilder
applyMutation, build
-
Methods inherited from interface software.amazon.awssdk.core.SdkPojo
equalsBySdkFields, sdkFields
-
-
-
-
Method Detail
-
invocationsPerInstance
ScalingPolicyMetric.Builder invocationsPerInstance(Integer invocationsPerInstance)
The number of invocations sent to a model, normalized by
InstanceCount
in each ProductionVariant.1/numberOfInstances
is sent as the value on each request, wherenumberOfInstances
is the number of active instances for the ProductionVariant behind the endpoint at the time of the request.- Parameters:
invocationsPerInstance
- The number of invocations sent to a model, normalized byInstanceCount
in each ProductionVariant.1/numberOfInstances
is sent as the value on each request, wherenumberOfInstances
is the number of active instances for the ProductionVariant behind the endpoint at the time of the request.- Returns:
- Returns a reference to this object so that method calls can be chained together.
-
modelLatency
ScalingPolicyMetric.Builder modelLatency(Integer modelLatency)
The interval of time taken by a model to respond as viewed from SageMaker. This interval includes the local communication times taken to send the request and to fetch the response from the container of a model and the time taken to complete the inference in the container.
- Parameters:
modelLatency
- The interval of time taken by a model to respond as viewed from SageMaker. This interval includes the local communication times taken to send the request and to fetch the response from the container of a model and the time taken to complete the inference in the container.- Returns:
- Returns a reference to this object so that method calls can be chained together.
-
-