Interface InferenceComponentComputeResourceRequirements.Builder
-
- All Superinterfaces:
Buildable
,CopyableBuilder<InferenceComponentComputeResourceRequirements.Builder,InferenceComponentComputeResourceRequirements>
,SdkBuilder<InferenceComponentComputeResourceRequirements.Builder,InferenceComponentComputeResourceRequirements>
,SdkPojo
- Enclosing class:
- InferenceComponentComputeResourceRequirements
public static interface InferenceComponentComputeResourceRequirements.Builder extends SdkPojo, CopyableBuilder<InferenceComponentComputeResourceRequirements.Builder,InferenceComponentComputeResourceRequirements>
-
-
Method Summary
All Methods Instance Methods Abstract Methods Modifier and Type Method Description InferenceComponentComputeResourceRequirements.Builder
maxMemoryRequiredInMb(Integer maxMemoryRequiredInMb)
The maximum MB of memory to allocate to run a model that you assign to an inference component.InferenceComponentComputeResourceRequirements.Builder
minMemoryRequiredInMb(Integer minMemoryRequiredInMb)
The minimum MB of memory to allocate to run a model that you assign to an inference component.InferenceComponentComputeResourceRequirements.Builder
numberOfAcceleratorDevicesRequired(Float numberOfAcceleratorDevicesRequired)
The number of accelerators to allocate to run a model that you assign to an inference component.InferenceComponentComputeResourceRequirements.Builder
numberOfCpuCoresRequired(Float numberOfCpuCoresRequired)
The number of CPU cores to allocate to run a model that you assign to an inference component.-
Methods inherited from interface software.amazon.awssdk.utils.builder.CopyableBuilder
copy
-
Methods inherited from interface software.amazon.awssdk.utils.builder.SdkBuilder
applyMutation, build
-
Methods inherited from interface software.amazon.awssdk.core.SdkPojo
equalsBySdkFields, sdkFieldNameToField, sdkFields
-
-
-
-
Method Detail
-
numberOfCpuCoresRequired
InferenceComponentComputeResourceRequirements.Builder numberOfCpuCoresRequired(Float numberOfCpuCoresRequired)
The number of CPU cores to allocate to run a model that you assign to an inference component.
- Parameters:
numberOfCpuCoresRequired
- The number of CPU cores to allocate to run a model that you assign to an inference component.- Returns:
- Returns a reference to this object so that method calls can be chained together.
-
numberOfAcceleratorDevicesRequired
InferenceComponentComputeResourceRequirements.Builder numberOfAcceleratorDevicesRequired(Float numberOfAcceleratorDevicesRequired)
The number of accelerators to allocate to run a model that you assign to an inference component. Accelerators include GPUs and Amazon Web Services Inferentia.
- Parameters:
numberOfAcceleratorDevicesRequired
- The number of accelerators to allocate to run a model that you assign to an inference component. Accelerators include GPUs and Amazon Web Services Inferentia.- Returns:
- Returns a reference to this object so that method calls can be chained together.
-
minMemoryRequiredInMb
InferenceComponentComputeResourceRequirements.Builder minMemoryRequiredInMb(Integer minMemoryRequiredInMb)
The minimum MB of memory to allocate to run a model that you assign to an inference component.
- Parameters:
minMemoryRequiredInMb
- The minimum MB of memory to allocate to run a model that you assign to an inference component.- Returns:
- Returns a reference to this object so that method calls can be chained together.
-
maxMemoryRequiredInMb
InferenceComponentComputeResourceRequirements.Builder maxMemoryRequiredInMb(Integer maxMemoryRequiredInMb)
The maximum MB of memory to allocate to run a model that you assign to an inference component.
- Parameters:
maxMemoryRequiredInMb
- The maximum MB of memory to allocate to run a model that you assign to an inference component.- Returns:
- Returns a reference to this object so that method calls can be chained together.
-
-