co.elastic.clients.elasticsearch._types.RequestBase

co.elastic.clients.elasticsearch.inference.PutHuggingFaceRequest

All Implemented Interfaces:: JsonpSerializable

@JsonpDeserializable public class PutHuggingFaceRequest extends RequestBase implements JsonpSerializable

Create a Hugging Face inference endpoint.

Create an inference endpoint to perform an inference task with the hugging_face service.

You must first create an inference endpoint on the Hugging Face endpoint page to get an endpoint URL. Select the model you want to use on the new endpoint creation page (for example intfloat/e5-small-v2), then select the sentence embeddings task under the advanced configuration section. Create the endpoint and copy the URL after the endpoint initialization has been finished.

The following models are recommended for the Hugging Face service:

all-MiniLM-L6-v2
all-MiniLM-L12-v2
all-mpnet-base-v2
e5-base-v2
e5-small-v2
multilingual-e5-base
multilingual-e5-small

When you create an inference endpoint, the associated machine learning model is automatically deployed if it is not already running. After creating the endpoint, wait for the model deployment to complete before using it. To verify the deployment status, use the get trained model statistics API. Look for "state": "fully_allocated" in the response and ensure that the "allocation_count" matches the "target_allocation_count". Avoid creating multiple endpoints for the same model unless required, as each endpoint consumes significant resources.

See Also:

API specification

Nested Class Summary

Nested Classes

Modifier and Type

Class

Description

static class

PutHuggingFaceRequest.Builder

Builder for PutHuggingFaceRequest.

Nested classes/interfaces inherited from class co.elastic.clients.elasticsearch._types.RequestBase
RequestBase.AbstractBuilder<BuilderT extends RequestBase.AbstractBuilder<BuilderT>>
Field Summary

Fields

Modifier and Type

Field

Description

static final JsonpDeserializer<PutHuggingFaceRequest>

_DESERIALIZER

Json deserializer for PutHuggingFaceRequest

static final Endpoint<PutHuggingFaceRequest,PutHuggingFaceResponse,ErrorResponse>

_ENDPOINT

Endpoint "inference.put_hugging_face".
Method Summary

Modifier and Type

Method

Description

final InferenceChunkingSettings

chunkingSettings()

The chunking configuration object.

final String

huggingfaceInferenceId()

Required - The unique identifier of the inference endpoint.

static PutHuggingFaceRequest

of(Function<PutHuggingFaceRequest.Builder,ObjectBuilder<PutHuggingFaceRequest>> fn)

void

serialize(jakarta.json.stream.JsonGenerator generator, JsonpMapper mapper)

Serialize this object to JSON.

protected void

serializeInternal(jakarta.json.stream.JsonGenerator generator, JsonpMapper mapper)

final HuggingFaceServiceType

service()

Required - The type of service supported for the specified task type.

final HuggingFaceServiceSettings

serviceSettings()

Required - Settings used to install the inference model.

protected static void

setupPutHuggingFaceRequestDeserializer(ObjectDeserializer<PutHuggingFaceRequest.Builder> op)

final HuggingFaceTaskType

taskType()

Required - The type of the inference task that the model will perform.

Methods inherited from class co.elastic.clients.elasticsearch._types.RequestBase
toString

Methods inherited from class java.lang.Object
clone, equals, finalize, getClass, hashCode, notify, notifyAll, wait, wait, wait

Field Details
- _DESERIALIZER
  
  public static final JsonpDeserializer<PutHuggingFaceRequest> _DESERIALIZER
  
  Json deserializer for PutHuggingFaceRequest
- _ENDPOINT
  
  public static final Endpoint<PutHuggingFaceRequest,PutHuggingFaceResponse,ErrorResponse> _ENDPOINT
  
  Endpoint "inference.put_hugging_face".
Method Details
- of
  
  public static PutHuggingFaceRequest of(Function<PutHuggingFaceRequest.Builder,ObjectBuilder<PutHuggingFaceRequest>> fn)
- chunkingSettings
  
  @Nullable public final InferenceChunkingSettings chunkingSettings()
  
  The chunking configuration object.
  API name: chunking_settings
- huggingfaceInferenceId
  
  public final String huggingfaceInferenceId()
  
  Required - The unique identifier of the inference endpoint.
  API name: huggingface_inference_id
- service
  
  public final HuggingFaceServiceType service()
  
  Required - The type of service supported for the specified task type. In this case, hugging_face.
  API name: service
- serviceSettings
  
  public final HuggingFaceServiceSettings serviceSettings()
  
  Required - Settings used to install the inference model. These settings are specific to the hugging_face service.
  API name: service_settings
- taskType
  
  public final HuggingFaceTaskType taskType()
  
  Required - The type of the inference task that the model will perform.
  API name: task_type
- serialize
  
  public void serialize(jakarta.json.stream.JsonGenerator generator, JsonpMapper mapper)
  
  Serialize this object to JSON.
  
  Specified by:
  
  serialize in interface JsonpSerializable
- serializeInternal
  
  protected void serializeInternal(jakarta.json.stream.JsonGenerator generator, JsonpMapper mapper)
- setupPutHuggingFaceRequestDeserializer
  
  protected static void setupPutHuggingFaceRequestDeserializer(ObjectDeserializer<PutHuggingFaceRequest.Builder> op)

Class PutHuggingFaceRequest

Nested Class Summary

Nested classes/interfaces inherited from class co.elastic.clients.elasticsearch._types.RequestBase

Field Summary

Method Summary

Methods inherited from class co.elastic.clients.elasticsearch._types.RequestBase

Methods inherited from class java.lang.Object

Field Details

_DESERIALIZER

_ENDPOINT

Method Details

of

chunkingSettings

huggingfaceInferenceId

service

serviceSettings

taskType

serialize

serializeInternal

setupPutHuggingFaceRequestDeserializer