Package com.pulumi.aws.sagemaker.inputs
Class EndpointConfigurationShadowProductionVariantArgs
- java.lang.Object
-
- com.pulumi.resources.InputArgs
-
- com.pulumi.resources.ResourceArgs
-
- com.pulumi.aws.sagemaker.inputs.EndpointConfigurationShadowProductionVariantArgs
-
public final class EndpointConfigurationShadowProductionVariantArgs extends com.pulumi.resources.ResourceArgs
-
-
Nested Class Summary
Nested Classes Modifier and Type Class Description static class
EndpointConfigurationShadowProductionVariantArgs.Builder
-
Field Summary
Fields Modifier and Type Field Description static EndpointConfigurationShadowProductionVariantArgs
Empty
-
Method Summary
All Methods Static Methods Instance Methods Concrete Methods Modifier and Type Method Description java.util.Optional<com.pulumi.core.Output<java.lang.String>>
acceleratorType()
static EndpointConfigurationShadowProductionVariantArgs.Builder
builder()
static EndpointConfigurationShadowProductionVariantArgs.Builder
builder(EndpointConfigurationShadowProductionVariantArgs defaults)
java.util.Optional<com.pulumi.core.Output<java.lang.Integer>>
containerStartupHealthCheckTimeoutInSeconds()
java.util.Optional<com.pulumi.core.Output<EndpointConfigurationShadowProductionVariantCoreDumpConfigArgs>>
coreDumpConfig()
java.util.Optional<com.pulumi.core.Output<java.lang.Boolean>>
enableSsmAccess()
java.util.Optional<com.pulumi.core.Output<java.lang.Integer>>
initialInstanceCount()
java.util.Optional<com.pulumi.core.Output<java.lang.Double>>
initialVariantWeight()
java.util.Optional<com.pulumi.core.Output<java.lang.String>>
instanceType()
java.util.Optional<com.pulumi.core.Output<java.lang.Integer>>
modelDataDownloadTimeoutInSeconds()
com.pulumi.core.Output<java.lang.String>
modelName()
java.util.Optional<com.pulumi.core.Output<java.util.List<EndpointConfigurationShadowProductionVariantRoutingConfigArgs>>>
routingConfigs()
java.util.Optional<com.pulumi.core.Output<EndpointConfigurationShadowProductionVariantServerlessConfigArgs>>
serverlessConfig()
java.util.Optional<com.pulumi.core.Output<java.lang.String>>
variantName()
java.util.Optional<com.pulumi.core.Output<java.lang.Integer>>
volumeSizeInGb()
-
-
-
Field Detail
-
Empty
public static final EndpointConfigurationShadowProductionVariantArgs Empty
-
-
Method Detail
-
acceleratorType
public java.util.Optional<com.pulumi.core.Output<java.lang.String>> acceleratorType()
- Returns:
- The size of the Elastic Inference (EI) instance to use for the production variant.
-
containerStartupHealthCheckTimeoutInSeconds
public java.util.Optional<com.pulumi.core.Output<java.lang.Integer>> containerStartupHealthCheckTimeoutInSeconds()
- Returns:
- The timeout value, in seconds, for your inference container to pass health check by SageMaker Hosting. For more information about health check, see [How Your Container Should Respond to Health Check (Ping) Requests](https://docs.aws.amazon.com/sagemaker/latest/dg/your-algorithms-inference-code.html#your-algorithms-inference-algo-ping-requests). Valid values between `60` and `3600`.
-
coreDumpConfig
public java.util.Optional<com.pulumi.core.Output<EndpointConfigurationShadowProductionVariantCoreDumpConfigArgs>> coreDumpConfig()
- Returns:
- Specifies configuration for a core dump from the model container when the process crashes. Fields are documented below.
-
enableSsmAccess
public java.util.Optional<com.pulumi.core.Output<java.lang.Boolean>> enableSsmAccess()
- Returns:
- You can use this parameter to turn on native Amazon Web Services Systems Manager (SSM) access for a production variant behind an endpoint. By default, SSM access is disabled for all production variants behind an endpoints.
-
initialInstanceCount
public java.util.Optional<com.pulumi.core.Output<java.lang.Integer>> initialInstanceCount()
- Returns:
- Initial number of instances used for auto-scaling.
-
initialVariantWeight
public java.util.Optional<com.pulumi.core.Output<java.lang.Double>> initialVariantWeight()
- Returns:
- Determines initial traffic distribution among all of the models that you specify in the endpoint configuration. If unspecified, it defaults to `1.0`.
-
instanceType
public java.util.Optional<com.pulumi.core.Output<java.lang.String>> instanceType()
- Returns:
- The type of instance to start.
-
modelDataDownloadTimeoutInSeconds
public java.util.Optional<com.pulumi.core.Output<java.lang.Integer>> modelDataDownloadTimeoutInSeconds()
- Returns:
- The timeout value, in seconds, to download and extract the model that you want to host from Amazon S3 to the individual inference instance associated with this production variant. Valid values between `60` and `3600`.
-
modelName
public com.pulumi.core.Output<java.lang.String> modelName()
- Returns:
- The name of the model to use.
-
routingConfigs
public java.util.Optional<com.pulumi.core.Output<java.util.List<EndpointConfigurationShadowProductionVariantRoutingConfigArgs>>> routingConfigs()
- Returns:
- Sets how the endpoint routes incoming traffic. See routing_config below.
-
serverlessConfig
public java.util.Optional<com.pulumi.core.Output<EndpointConfigurationShadowProductionVariantServerlessConfigArgs>> serverlessConfig()
- Returns:
- Specifies configuration for how an endpoint performs asynchronous inference.
-
variantName
public java.util.Optional<com.pulumi.core.Output<java.lang.String>> variantName()
- Returns:
- The name of the variant. If omitted, this provider will assign a random, unique name.
-
volumeSizeInGb
public java.util.Optional<com.pulumi.core.Output<java.lang.Integer>> volumeSizeInGb()
- Returns:
- The size, in GB, of the ML storage volume attached to individual inference instance associated with the production variant. Valid values between `1` and `512`.
-
builder
public static EndpointConfigurationShadowProductionVariantArgs.Builder builder()
-
builder
public static EndpointConfigurationShadowProductionVariantArgs.Builder builder(EndpointConfigurationShadowProductionVariantArgs defaults)
-
-