Interface ChatModelConfig
public interface ChatModelConfig
-
Method Summary
Modifier and TypeMethodDescriptionadToken()The Azure AD token to use for this operation.apiKey()Azure OpenAI API keyThe API version to use for this operation.This property will override thequarkus.langchain4j.azure-openai.deployment-namespecifically for chat models if it is set.This property will override thequarkus.langchain4j.azure-openai.domain-namespecifically for chat models if it is set.endpoint()This property will override thequarkus.langchain4j.azure-openai.endpointspecifically for chat models if it is set.Number between -2.0 and 2.0.Whether chat model requests should be loggedWhether chat model responses should be loggedThe maximum number of tokens to generate in the completion.Number between -2.0 and 2.0.This property will override thequarkus.langchain4j.azure-openai.resource-namespecifically for chat models if it is set.The response format the model should use.seed()If specified, our system will make the best effort to sample deterministically, such that repeated requests with the same seed and parameters should return the same result.What sampling temperature to use, with values between 0 and 2.topP()An alternative to sampling with temperature, called nucleus sampling, where the model considers the results of the tokens with topP probability mass. 0.1 means only the tokens comprising the top 10% probability mass are considered.
-
Method Details
-
resourceName
-
domainName
-
deploymentName
-
endpoint
-
adToken
-
apiVersion
-
apiKey
-
temperature
What sampling temperature to use, with values between 0 and 2. Higher values means the model will take more risks. A value of 0.9 is good for more creative applications, while 0 (argmax sampling) is good for ones with a well-defined answer. It is recommended to alter this or topP, but not both. -
topP
An alternative to sampling with temperature, called nucleus sampling, where the model considers the results of the tokens with topP probability mass. 0.1 means only the tokens comprising the top 10% probability mass are considered. It is recommended to alter this or temperature, but not both. -
seed
If specified, our system will make the best effort to sample deterministically, such that repeated requests with the same seed and parameters should return the same result. Determinism isn't guaranteed. Support for reproducible output was first added in API version 2023-12-01-preview -
maxTokens
-
presencePenalty
Number between -2.0 and 2.0. Positive values penalize new tokens based on whether they appear in the text so far, increasing the model's likelihood to talk about new topics. -
frequencyPenalty
Number between -2.0 and 2.0. Positive values penalize new tokens based on their existing frequency in the text so far, decreasing the model's likelihood to repeat the same line verbatim. -
logRequests
-
logResponses
-
responseFormat
-