Class RateLimitSetting
java.lang.Object
co.elastic.clients.elasticsearch.inference.RateLimitSetting
- All Implemented Interfaces:
JsonpSerializable
This setting helps to minimize the number of rate limit errors returned from
the service.
- See Also:
-
Nested Class Summary
Nested Classes -
Field Summary
FieldsModifier and TypeFieldDescriptionstatic final JsonpDeserializer<RateLimitSetting>Json deserializer forRateLimitSetting -
Method Summary
Modifier and TypeMethodDescriptionstatic RateLimitSettingfinal IntegerThe number of requests allowed per minute.voidserialize(jakarta.json.stream.JsonGenerator generator, JsonpMapper mapper) Serialize this object to JSON.protected voidserializeInternal(jakarta.json.stream.JsonGenerator generator, JsonpMapper mapper) protected static voidtoString()
-
Field Details
-
_DESERIALIZER
Json deserializer forRateLimitSetting
-
-
Method Details
-
of
public static RateLimitSetting of(Function<RateLimitSetting.Builder, ObjectBuilder<RateLimitSetting>> fn) -
requestsPerMinute
The number of requests allowed per minute. By default, the number of requests allowed per minute is set by each service as follows:alibabacloud-ai-searchservice:1000anthropicservice:50azureaistudioservice:240azureopenaiservice and task typetext_embedding:1440azureopenaiservice and task typecompletion:120cohereservice:10000contextualaiservice:1000elasticservice and task typechat_completion:240googleaistudioservice:360googlevertexaiservice:30000hugging_faceservice:3000jinaaiservice:2000llamaservice:3000mistralservice:240openaiservice and task typetext_embedding:3000openaiservice and task typecompletion:500voyageaiservice:2000watsonxaiservice:120
API name:
requests_per_minute -
serialize
Serialize this object to JSON.- Specified by:
serializein interfaceJsonpSerializable
-
serializeInternal
-
toString
-
setupRateLimitSettingDeserializer
protected static void setupRateLimitSettingDeserializer(ObjectDeserializer<RateLimitSetting.Builder> op)
-