Class RateLimitSetting
java.lang.Object
co.elastic.clients.elasticsearch.inference.RateLimitSetting
- All Implemented Interfaces:
JsonpSerializable
This setting helps to minimize the number of rate limit errors returned from
the service.
- See Also:
-
Nested Class Summary
Nested Classes -
Field Summary
FieldsModifier and TypeFieldDescriptionstatic final JsonpDeserializer<RateLimitSetting>
Json deserializer forRateLimitSetting
-
Method Summary
Modifier and TypeMethodDescriptionstatic RateLimitSetting
final Integer
The number of requests allowed per minute.void
serialize
(jakarta.json.stream.JsonGenerator generator, JsonpMapper mapper) Serialize this object to JSON.protected void
serializeInternal
(jakarta.json.stream.JsonGenerator generator, JsonpMapper mapper) protected static void
toString()
-
Field Details
-
_DESERIALIZER
Json deserializer forRateLimitSetting
-
-
Method Details
-
of
public static RateLimitSetting of(Function<RateLimitSetting.Builder, ObjectBuilder<RateLimitSetting>> fn) -
requestsPerMinute
The number of requests allowed per minute. By default, the number of requests allowed per minute is set by each service as follows:alibabacloud-ai-search
service:1000
anthropic
service:50
azureaistudio
service:240
azureopenai
service and task typetext_embedding
:1440
azureopenai
service and task typecompletion
:120
cohere
service:10000
elastic
service and task typechat_completion
:240
googleaistudio
service:360
googlevertexai
service:30000
hugging_face
service:3000
jinaai
service:2000
mistral
service:240
openai
service and task typetext_embedding
:3000
openai
service and task typecompletion
:500
voyageai
service:2000
watsonxai
service:120
API name:
requests_per_minute
-
serialize
Serialize this object to JSON.- Specified by:
serialize
in interfaceJsonpSerializable
-
serializeInternal
-
toString
-
setupRateLimitSettingDeserializer
protected static void setupRateLimitSettingDeserializer(ObjectDeserializer<RateLimitSetting.Builder> op)
-