Class PutOpenaiRequest
- All Implemented Interfaces:
JsonpSerializable
Create an inference endpoint to perform an inference task with the
openai service.
When you create an inference endpoint, the associated machine learning model
is automatically deployed if it is not already running. After creating the
endpoint, wait for the model deployment to complete before using it. To
verify the deployment status, use the get trained model statistics API. Look
for "state": "fully_allocated" in the
response and ensure that the "allocation_count"
matches the "target_allocation_count". Avoid creating
multiple endpoints for the same model unless required, as each endpoint
consumes significant resources.
- See Also:
-
Nested Class Summary
Nested ClassesNested classes/interfaces inherited from class co.elastic.clients.elasticsearch._types.RequestBase
RequestBase.AbstractBuilder<BuilderT extends RequestBase.AbstractBuilder<BuilderT>> -
Field Summary
FieldsModifier and TypeFieldDescriptionstatic final JsonpDeserializer<PutOpenaiRequest>Json deserializer forPutOpenaiRequeststatic final Endpoint<PutOpenaiRequest,PutOpenaiResponse, ErrorResponse> Endpoint "inference.put_openai". -
Method Summary
Modifier and TypeMethodDescriptionThe chunking configuration object.static PutOpenaiRequestfinal StringRequired - The unique identifier of the inference endpoint.voidserialize(jakarta.json.stream.JsonGenerator generator, JsonpMapper mapper) Serialize this object to JSON.protected voidserializeInternal(jakarta.json.stream.JsonGenerator generator, JsonpMapper mapper) final OpenAIServiceTypeservice()Required - The type of service supported for the specified task type.final OpenAIServiceSettingsRequired - Settings used to install the inference model.protected static voidfinal OpenAITaskSettingsSettings to configure the inference task.final OpenAITaskTypetaskType()Required - The type of the inference task that the model will perform.Methods inherited from class co.elastic.clients.elasticsearch._types.RequestBase
toString
-
Field Details
-
_DESERIALIZER
Json deserializer forPutOpenaiRequest -
_ENDPOINT
Endpoint "inference.put_openai".
-
-
Method Details
-
of
public static PutOpenaiRequest of(Function<PutOpenaiRequest.Builder, ObjectBuilder<PutOpenaiRequest>> fn) -
chunkingSettings
The chunking configuration object.API name:
chunking_settings -
openaiInferenceId
Required - The unique identifier of the inference endpoint.API name:
openai_inference_id -
service
Required - The type of service supported for the specified task type. In this case,openai.API name:
service -
serviceSettings
Required - Settings used to install the inference model. These settings are specific to theopenaiservice.API name:
service_settings -
taskSettings
Settings to configure the inference task. These settings are specific to the task type you specified.API name:
task_settings -
taskType
Required - The type of the inference task that the model will perform. NOTE: Thechat_completiontask type only supports streaming and only through the _stream API.API name:
task_type -
serialize
Serialize this object to JSON.- Specified by:
serializein interfaceJsonpSerializable
-
serializeInternal
-
setupPutOpenaiRequestDeserializer
protected static void setupPutOpenaiRequestDeserializer(ObjectDeserializer<PutOpenaiRequest.Builder> op)
-