@Generated(value="software.amazon.awssdk:codegen") public final class ProductionVariant extends Object implements SdkPojo, Serializable, ToCopyableBuilder<ProductionVariant.Builder,ProductionVariant>
Identifies a model that you want to host and the resources chosen to deploy for hosting it. If you are deploying multiple models, tell SageMaker how to distribute traffic among the models by specifying variant weights.
Modifier and Type | Class and Description |
---|---|
static interface |
ProductionVariant.Builder |
Modifier and Type | Method and Description |
---|---|
ProductionVariantAcceleratorType |
acceleratorType()
The size of the Elastic Inference (EI) instance to use for the production variant.
|
String |
acceleratorTypeAsString()
The size of the Elastic Inference (EI) instance to use for the production variant.
|
static ProductionVariant.Builder |
builder() |
ProductionVariantCoreDumpConfig |
coreDumpConfig()
Specifies configuration for a core dump from the model container when the process crashes.
|
boolean |
equals(Object obj) |
boolean |
equalsBySdkFields(Object obj) |
<T> Optional<T> |
getValueForField(String fieldName,
Class<T> clazz) |
int |
hashCode() |
Integer |
initialInstanceCount()
Number of instances to launch initially.
|
Float |
initialVariantWeight()
Determines initial traffic distribution among all of the models that you specify in the endpoint configuration.
|
ProductionVariantInstanceType |
instanceType()
The ML compute instance type.
|
String |
instanceTypeAsString()
The ML compute instance type.
|
String |
modelName()
The name of the model that you want to host.
|
List<SdkField<?>> |
sdkFields() |
static Class<? extends ProductionVariant.Builder> |
serializableBuilderClass() |
ProductionVariantServerlessConfig |
serverlessConfig()
The serverless configuration for an endpoint.
|
ProductionVariant.Builder |
toBuilder() |
String |
toString()
Returns a string representation of this object.
|
String |
variantName()
The name of the production variant.
|
clone, finalize, getClass, notify, notifyAll, wait, wait, wait
copy
public final String variantName()
The name of the production variant.
public final String modelName()
The name of the model that you want to host. This is the name that you specified when creating the model.
public final Integer initialInstanceCount()
Number of instances to launch initially.
public final ProductionVariantInstanceType instanceType()
The ML compute instance type.
If the service returns an enum value that is not available in the current SDK version, instanceType
will
return ProductionVariantInstanceType.UNKNOWN_TO_SDK_VERSION
. The raw value returned by the service is
available from instanceTypeAsString()
.
ProductionVariantInstanceType
public final String instanceTypeAsString()
The ML compute instance type.
If the service returns an enum value that is not available in the current SDK version, instanceType
will
return ProductionVariantInstanceType.UNKNOWN_TO_SDK_VERSION
. The raw value returned by the service is
available from instanceTypeAsString()
.
ProductionVariantInstanceType
public final Float initialVariantWeight()
Determines initial traffic distribution among all of the models that you specify in the endpoint configuration.
The traffic to a production variant is determined by the ratio of the VariantWeight
to the sum of
all VariantWeight
values across all ProductionVariants. If unspecified, it defaults to 1.0.
VariantWeight
to the sum of all VariantWeight
values across all
ProductionVariants. If unspecified, it defaults to 1.0.public final ProductionVariantAcceleratorType acceleratorType()
The size of the Elastic Inference (EI) instance to use for the production variant. EI instances provide on-demand GPU computing for inference. For more information, see Using Elastic Inference in Amazon SageMaker.
If the service returns an enum value that is not available in the current SDK version, acceleratorType
will return ProductionVariantAcceleratorType.UNKNOWN_TO_SDK_VERSION
. The raw value returned by the
service is available from acceleratorTypeAsString()
.
ProductionVariantAcceleratorType
public final String acceleratorTypeAsString()
The size of the Elastic Inference (EI) instance to use for the production variant. EI instances provide on-demand GPU computing for inference. For more information, see Using Elastic Inference in Amazon SageMaker.
If the service returns an enum value that is not available in the current SDK version, acceleratorType
will return ProductionVariantAcceleratorType.UNKNOWN_TO_SDK_VERSION
. The raw value returned by the
service is available from acceleratorTypeAsString()
.
ProductionVariantAcceleratorType
public final ProductionVariantCoreDumpConfig coreDumpConfig()
Specifies configuration for a core dump from the model container when the process crashes.
public final ProductionVariantServerlessConfig serverlessConfig()
The serverless configuration for an endpoint. Specifies a serverless endpoint configuration instead of an instance-based endpoint configuration.
public ProductionVariant.Builder toBuilder()
toBuilder
in interface ToCopyableBuilder<ProductionVariant.Builder,ProductionVariant>
public static ProductionVariant.Builder builder()
public static Class<? extends ProductionVariant.Builder> serializableBuilderClass()
public final boolean equalsBySdkFields(Object obj)
equalsBySdkFields
in interface SdkPojo
public final String toString()
Copyright © 2022. All rights reserved.