Class PutLlamaRequest
java.lang.Object
co.elastic.clients.elasticsearch._types.RequestBase
co.elastic.clients.elasticsearch.inference.PutLlamaRequest
- All Implemented Interfaces:
JsonpSerializable
Create a Llama inference endpoint.
Create an inference endpoint to perform an inference task with the
llama service.
- See Also:
-
Nested Class Summary
Nested ClassesNested classes/interfaces inherited from class co.elastic.clients.elasticsearch._types.RequestBase
RequestBase.AbstractBuilder<BuilderT extends RequestBase.AbstractBuilder<BuilderT>> -
Field Summary
FieldsModifier and TypeFieldDescriptionstatic final JsonpDeserializer<PutLlamaRequest>Json deserializer forPutLlamaRequeststatic final Endpoint<PutLlamaRequest,PutLlamaResponse, ErrorResponse> Endpoint "inference.put_llama". -
Method Summary
Modifier and TypeMethodDescriptionThe chunking configuration object.final StringRequired - The unique identifier of the inference endpoint.static PutLlamaRequestvoidserialize(jakarta.json.stream.JsonGenerator generator, JsonpMapper mapper) Serialize this object to JSON.protected voidserializeInternal(jakarta.json.stream.JsonGenerator generator, JsonpMapper mapper) final LlamaServiceTypeservice()Required - The type of service supported for the specified task type.final LlamaServiceSettingsRequired - Settings used to install the inference model.protected static voidfinal LlamaTaskTypetaskType()Required - The type of the inference task that the model will perform.final Timetimeout()Specifies the amount of time to wait for the inference endpoint to be created.Methods inherited from class co.elastic.clients.elasticsearch._types.RequestBase
toString
-
Field Details
-
_DESERIALIZER
Json deserializer forPutLlamaRequest -
_ENDPOINT
Endpoint "inference.put_llama".
-
-
Method Details
-
of
public static PutLlamaRequest of(Function<PutLlamaRequest.Builder, ObjectBuilder<PutLlamaRequest>> fn) -
chunkingSettings
The chunking configuration object. Applies only to thetext_embeddingtask type. Not applicable to thecompletionorchat_completiontask types.API name:
chunking_settings -
llamaInferenceId
Required - The unique identifier of the inference endpoint.API name:
llama_inference_id -
service
Required - The type of service supported for the specified task type. In this case,llama.API name:
service -
serviceSettings
Required - Settings used to install the inference model. These settings are specific to thellamaservice.API name:
service_settings -
taskType
Required - The type of the inference task that the model will perform.API name:
task_type -
timeout
Specifies the amount of time to wait for the inference endpoint to be created.API name:
timeout -
serialize
Serialize this object to JSON.- Specified by:
serializein interfaceJsonpSerializable
-
serializeInternal
-
setupPutLlamaRequestDeserializer
protected static void setupPutLlamaRequestDeserializer(ObjectDeserializer<PutLlamaRequest.Builder> op)
-