Class TextInferenceConfig
- java.lang.Object
-
- software.amazon.awssdk.services.bedrockagentruntime.model.TextInferenceConfig
-
- All Implemented Interfaces:
Serializable
,SdkPojo
,ToCopyableBuilder<TextInferenceConfig.Builder,TextInferenceConfig>
@Generated("software.amazon.awssdk:codegen") public final class TextInferenceConfig extends Object implements SdkPojo, Serializable, ToCopyableBuilder<TextInferenceConfig.Builder,TextInferenceConfig>
Configuration settings for text generation using a language model via the RetrieveAndGenerate operation. Includes parameters like temperature, top-p, maximum token count, and stop sequences.
The valid range of
maxTokens
depends on the accepted values for your chosen model's inference parameters. To see the inference parameters for your model, see Inference parameters for foundation models.- See Also:
- Serialized Form
-
-
Nested Class Summary
Nested Classes Modifier and Type Class Description static interface
TextInferenceConfig.Builder
-
Method Summary
All Methods Static Methods Instance Methods Concrete Methods Modifier and Type Method Description static TextInferenceConfig.Builder
builder()
boolean
equals(Object obj)
boolean
equalsBySdkFields(Object obj)
<T> Optional<T>
getValueForField(String fieldName, Class<T> clazz)
int
hashCode()
boolean
hasStopSequences()
For responses, this returns true if the service returned a value for the StopSequences property.Integer
maxTokens()
The maximum number of tokens to generate in the output text.List<SdkField<?>>
sdkFields()
static Class<? extends TextInferenceConfig.Builder>
serializableBuilderClass()
List<String>
stopSequences()
A list of sequences of characters that, if generated, will cause the model to stop generating further tokens.Float
temperature()
Controls the random-ness of text generated by the language model, influencing how much the model sticks to the most predictable next words versus exploring more surprising options.TextInferenceConfig.Builder
toBuilder()
Float
topP()
A probability distribution threshold which controls what the model considers for the set of possible next tokens.String
toString()
Returns a string representation of this object.-
Methods inherited from class java.lang.Object
clone, finalize, getClass, notify, notifyAll, wait, wait, wait
-
Methods inherited from interface software.amazon.awssdk.utils.builder.ToCopyableBuilder
copy
-
-
-
-
Method Detail
-
maxTokens
public final Integer maxTokens()
The maximum number of tokens to generate in the output text. Do not use the minimum of 0 or the maximum of 65536. The limit values described here are arbitary values, for actual values consult the limits defined by your specific model.
- Returns:
- The maximum number of tokens to generate in the output text. Do not use the minimum of 0 or the maximum of 65536. The limit values described here are arbitary values, for actual values consult the limits defined by your specific model.
-
hasStopSequences
public final boolean hasStopSequences()
For responses, this returns true if the service returned a value for the StopSequences property. This DOES NOT check that the value is non-empty (for which, you should check theisEmpty()
method on the property). This is useful because the SDK will never return a null collection or map, but you may need to differentiate between the service returning nothing (or null) and the service returning an empty collection or map. For requests, this returns true if a value for the property was specified in the request builder, and false if a value was not specified.
-
stopSequences
public final List<String> stopSequences()
A list of sequences of characters that, if generated, will cause the model to stop generating further tokens. Do not use a minimum length of 1 or a maximum length of 1000. The limit values described here are arbitary values, for actual values consult the limits defined by your specific model.
Attempts to modify the collection returned by this method will result in an UnsupportedOperationException.
This method will never return null. If you would like to know whether the service returned this field (so that you can differentiate between null and empty), you can use the
hasStopSequences()
method.- Returns:
- A list of sequences of characters that, if generated, will cause the model to stop generating further tokens. Do not use a minimum length of 1 or a maximum length of 1000. The limit values described here are arbitary values, for actual values consult the limits defined by your specific model.
-
temperature
public final Float temperature()
Controls the random-ness of text generated by the language model, influencing how much the model sticks to the most predictable next words versus exploring more surprising options. A lower temperature value (e.g. 0.2 or 0.3) makes model outputs more deterministic or predictable, while a higher temperature (e.g. 0.8 or 0.9) makes the outputs more creative or unpredictable.
- Returns:
- Controls the random-ness of text generated by the language model, influencing how much the model sticks to the most predictable next words versus exploring more surprising options. A lower temperature value (e.g. 0.2 or 0.3) makes model outputs more deterministic or predictable, while a higher temperature (e.g. 0.8 or 0.9) makes the outputs more creative or unpredictable.
-
topP
public final Float topP()
A probability distribution threshold which controls what the model considers for the set of possible next tokens. The model will only consider the top p% of the probability distribution when generating the next token.
- Returns:
- A probability distribution threshold which controls what the model considers for the set of possible next tokens. The model will only consider the top p% of the probability distribution when generating the next token.
-
toBuilder
public TextInferenceConfig.Builder toBuilder()
- Specified by:
toBuilder
in interfaceToCopyableBuilder<TextInferenceConfig.Builder,TextInferenceConfig>
-
builder
public static TextInferenceConfig.Builder builder()
-
serializableBuilderClass
public static Class<? extends TextInferenceConfig.Builder> serializableBuilderClass()
-
equalsBySdkFields
public final boolean equalsBySdkFields(Object obj)
- Specified by:
equalsBySdkFields
in interfaceSdkPojo
-
toString
public final String toString()
Returns a string representation of this object. This is useful for testing and debugging. Sensitive data will be redacted from this string using a placeholder value.
-
-