co.elastic.clients.elasticsearch.inference.LlamaServiceSettings.Builder

All Implemented Interfaces:: WithJson<LlamaServiceSettings.Builder>, ObjectBuilder<LlamaServiceSettings>

Enclosing class:: LlamaServiceSettings

public static class LlamaServiceSettings.Builder extends WithJsonObjectBuilderBase<LlamaServiceSettings.Builder> implements ObjectBuilder<LlamaServiceSettings>

Builder for LlamaServiceSettings.

Constructor Summary

Constructors

Constructor

Description

Builder()
Method Summary

Modifier and Type

Method

Description

LlamaServiceSettings

build()

Builds a LlamaServiceSettings.

final LlamaServiceSettings.Builder

maxInputTokens(Integer value)

For a text_embedding task, the maximum number of tokens per input before chunking occurs.

final LlamaServiceSettings.Builder

modelId(String value)

Required - The name of the model to use for the inference task.

final LlamaServiceSettings.Builder

rateLimit(RateLimitSetting value)

This setting helps to minimize the number of rate limit errors returned from the Llama API.

final LlamaServiceSettings.Builder

rateLimit(Function<RateLimitSetting.Builder,ObjectBuilder<RateLimitSetting>> fn)

This setting helps to minimize the number of rate limit errors returned from the Llama API.

protected LlamaServiceSettings.Builder

self()

final LlamaServiceSettings.Builder

similarity(LlamaSimilarityType value)

For a text_embedding task, the similarity measure.

final LlamaServiceSettings.Builder

url(String value)

Required - The URL endpoint of the Llama stack endpoint.

Methods inherited from class co.elastic.clients.util.WithJsonObjectBuilderBase
withJson

Methods inherited from class co.elastic.clients.util.ObjectBuilderBase
_checkSingleUse, _listAdd, _listAddAll, _mapPut, _mapPutAll

Methods inherited from class java.lang.Object
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait

Methods inherited from interface co.elastic.clients.json.WithJson
withJson, withJson

Constructor Details
- Builder
  
  public Builder()
Method Details
- url
  
  public final LlamaServiceSettings.Builder url(String value)
  Required - The URL endpoint of the Llama stack endpoint. URL must contain:
  
  For text_embedding task - /v1/inference/embeddings.
  
  For completion and chat_completion tasks - /v1/openai/v1/chat/completions.
  
  API name: url
- modelId
  
  public final LlamaServiceSettings.Builder modelId(String value)
  Required - The name of the model to use for the inference task. Refer to the Llama downloading models documentation for different ways of getting a list of available models and downloading them. Service has been tested and confirmed to be working with the following models:
  
  For text_embedding task - all-MiniLM-L6-v2.
  
  For completion and chat_completion tasks - llama3.2:3b.
  
  API name: model_id
- maxInputTokens
  
  public final LlamaServiceSettings.Builder maxInputTokens(@Nullable Integer value)
  
  For a text_embedding task, the maximum number of tokens per input before chunking occurs.
  API name: max_input_tokens
- similarity
  
  public final LlamaServiceSettings.Builder similarity(@Nullable LlamaSimilarityType value)
  
  For a text_embedding task, the similarity measure. One of cosine, dot_product, l2_norm.
  API name: similarity
- rateLimit
  
  public final LlamaServiceSettings.Builder rateLimit(@Nullable RateLimitSetting value)
  
  This setting helps to minimize the number of rate limit errors returned from the Llama API. By default, the llama service sets the number of requests allowed per minute to 3000.
  API name: rate_limit
- rateLimit
  
  public final LlamaServiceSettings.Builder rateLimit(Function<RateLimitSetting.Builder,ObjectBuilder<RateLimitSetting>> fn)
  
  This setting helps to minimize the number of rate limit errors returned from the Llama API. By default, the llama service sets the number of requests allowed per minute to 3000.
  API name: rate_limit
- self
  
  protected LlamaServiceSettings.Builder self()
  
  Specified by:
  
  self in class WithJsonObjectBuilderBase<LlamaServiceSettings.Builder>
- build
  
  public LlamaServiceSettings build()
  
  Builds a LlamaServiceSettings.
  
  Specified by:
  
  build in interface ObjectBuilder<LlamaServiceSettings>
  
  Throws:
  
  NullPointerException - if some of the required fields are null.

Class LlamaServiceSettings.Builder

Constructor Summary

Method Summary

Methods inherited from class co.elastic.clients.util.WithJsonObjectBuilderBase

Methods inherited from class co.elastic.clients.util.ObjectBuilderBase

Methods inherited from class java.lang.Object

Methods inherited from interface co.elastic.clients.json.WithJson

Constructor Details

Builder

Method Details

url

modelId

maxInputTokens

similarity

rateLimit

rateLimit

self

build