Class InferenceComponentRollingUpdatePolicy
- java.lang.Object
-
- software.amazon.awssdk.services.sagemaker.model.InferenceComponentRollingUpdatePolicy
-
- All Implemented Interfaces:
Serializable
,SdkPojo
,ToCopyableBuilder<InferenceComponentRollingUpdatePolicy.Builder,InferenceComponentRollingUpdatePolicy>
@Generated("software.amazon.awssdk:codegen") public final class InferenceComponentRollingUpdatePolicy extends Object implements SdkPojo, Serializable, ToCopyableBuilder<InferenceComponentRollingUpdatePolicy.Builder,InferenceComponentRollingUpdatePolicy>
Specifies a rolling deployment strategy for updating a SageMaker AI inference component.
- See Also:
- Serialized Form
-
-
Nested Class Summary
Nested Classes Modifier and Type Class Description static interface
InferenceComponentRollingUpdatePolicy.Builder
-
Method Summary
-
Methods inherited from class java.lang.Object
clone, finalize, getClass, notify, notifyAll, wait, wait, wait
-
Methods inherited from interface software.amazon.awssdk.utils.builder.ToCopyableBuilder
copy
-
-
-
-
Method Detail
-
maximumBatchSize
public final InferenceComponentCapacitySize maximumBatchSize()
The batch size for each rolling step in the deployment process. For each step, SageMaker AI provisions capacity on the new endpoint fleet, routes traffic to that fleet, and terminates capacity on the old endpoint fleet. The value must be between 5% to 50% of the copy count of the inference component.
- Returns:
- The batch size for each rolling step in the deployment process. For each step, SageMaker AI provisions capacity on the new endpoint fleet, routes traffic to that fleet, and terminates capacity on the old endpoint fleet. The value must be between 5% to 50% of the copy count of the inference component.
-
waitIntervalInSeconds
public final Integer waitIntervalInSeconds()
The length of the baking period, during which SageMaker AI monitors alarms for each batch on the new fleet.
- Returns:
- The length of the baking period, during which SageMaker AI monitors alarms for each batch on the new fleet.
-
maximumExecutionTimeoutInSeconds
public final Integer maximumExecutionTimeoutInSeconds()
The time limit for the total deployment. Exceeding this limit causes a timeout.
- Returns:
- The time limit for the total deployment. Exceeding this limit causes a timeout.
-
rollbackMaximumBatchSize
public final InferenceComponentCapacitySize rollbackMaximumBatchSize()
The batch size for a rollback to the old endpoint fleet. If this field is absent, the value is set to the default, which is 100% of the total capacity. When the default is used, SageMaker AI provisions the entire capacity of the old fleet at once during rollback.
- Returns:
- The batch size for a rollback to the old endpoint fleet. If this field is absent, the value is set to the default, which is 100% of the total capacity. When the default is used, SageMaker AI provisions the entire capacity of the old fleet at once during rollback.
-
toBuilder
public InferenceComponentRollingUpdatePolicy.Builder toBuilder()
- Specified by:
toBuilder
in interfaceToCopyableBuilder<InferenceComponentRollingUpdatePolicy.Builder,InferenceComponentRollingUpdatePolicy>
-
builder
public static InferenceComponentRollingUpdatePolicy.Builder builder()
-
serializableBuilderClass
public static Class<? extends InferenceComponentRollingUpdatePolicy.Builder> serializableBuilderClass()
-
equalsBySdkFields
public final boolean equalsBySdkFields(Object obj)
- Specified by:
equalsBySdkFields
in interfaceSdkPojo
-
toString
public final String toString()
Returns a string representation of this object. This is useful for testing and debugging. Sensitive data will be redacted from this string using a placeholder value.
-
sdkFieldNameToField
public final Map<String,SdkField<?>> sdkFieldNameToField()
- Specified by:
sdkFieldNameToField
in interfaceSdkPojo
-
-