@ThreadSafe @Generated(value="com.amazonaws:aws-java-sdk-code-generator") public class AmazonBedrockRuntimeClient extends AmazonWebServiceClient implements AmazonBedrockRuntime
Describes the API operations for running inference using Bedrock models.
LOGGING_AWS_REQUEST_METRIC
ENDPOINT_PREFIX
Modifier and Type | Method and Description |
---|---|
static AmazonBedrockRuntimeClientBuilder |
builder() |
ResponseMetadata |
getCachedResponseMetadata(AmazonWebServiceRequest request)
Returns additional metadata for a previously executed successful, request, typically used for debugging issues
where a service isn't acting as expected.
|
InvokeModelResult |
invokeModel(InvokeModelRequest request)
Invokes the specified Bedrock model to run inference using the input provided in the request body.
|
void |
shutdown()
Shuts down this client object, releasing any resources that might be held
open.
|
addRequestHandler, addRequestHandler, configureRegion, getClientConfiguration, getEndpointPrefix, getMonitoringListeners, getRequestMetricsCollector, getServiceName, getSignerByURI, getSignerOverride, getSignerRegionOverride, getTimeOffset, makeImmutable, removeRequestHandler, removeRequestHandler, setEndpoint, setEndpoint, setRegion, setServiceNameIntern, setSignerRegionOverride, setTimeOffset, withEndpoint, withRegion, withRegion, withTimeOffset
public static AmazonBedrockRuntimeClientBuilder builder()
public InvokeModelResult invokeModel(InvokeModelRequest request)
Invokes the specified Bedrock model to run inference using the input provided in the request body. You use InvokeModel to run inference for text models, image models, and embedding models.
For more information, see Run inference in the Bedrock User Guide.
For example requests, see Examples (after the Errors section).
invokeModel
in interface AmazonBedrockRuntime
invokeModelRequest
- AccessDeniedException
- The request is denied because of missing access permissions.ResourceNotFoundException
- The specified resource ARN was not found. Check the ARN and try your request again.ThrottlingException
- The number of requests exceeds the limit. Resubmit your request later.ModelTimeoutException
- The request took too long to process. Processing time exceeded the model timeout length.InternalServerException
- An internal server error occurred. Retry your request.ValidationException
- Input validation failed. Check your request parameters and retry the request.ModelNotReadyException
- The model specified in the request is not ready to serve inference requests.ServiceQuotaExceededException
- The number of requests exceeds the service quota. Resubmit your request later.ModelErrorException
- The request failed due to an error while processing the model.public ResponseMetadata getCachedResponseMetadata(AmazonWebServiceRequest request)
Response metadata is only cached for a limited period of time, so if you need to access this extra diagnostic information for an executed request, you should use this method to retrieve it as soon as possible after executing the request.
getCachedResponseMetadata
in interface AmazonBedrockRuntime
request
- The originally executed requestpublic void shutdown()
AmazonWebServiceClient
shutdown
in interface AmazonBedrockRuntime
shutdown
in class AmazonWebServiceClient