All Implemented Interfaces:


public final class ReinforcementHyperparameters

The hyperparameters used for the reinforcement fine-tuning job.

Nested Class Summary

Nested Classes
Modifier and Type	Class	Description
`public final class`	`ReinforcementHyperparameters.Builder`	A builder for ReinforcementHyperparameters.
`public final class`	`ReinforcementHyperparameters.BatchSize`	Number of examples in each batch. A larger batch size means that model parameters are updated less frequently, but with lower variance.
`public final class`	`ReinforcementHyperparameters.ComputeMultiplier`	Multiplier on amount of compute used for exploring search space during training.
`public final class`	`ReinforcementHyperparameters.EvalInterval`	The number of training steps between evaluation runs.
`public final class`	`ReinforcementHyperparameters.EvalSamples`	Number of evaluation samples to generate per training step.
`public final class`	`ReinforcementHyperparameters.LearningRateMultiplier`	Scaling factor for the learning rate. A smaller learning rate may be useful to avoid overfitting.
`public final class`	`ReinforcementHyperparameters.NEpochs`	The number of epochs to train the model for. An epoch refers to one full cycle through the training dataset.
`public final class`	`ReinforcementHyperparameters.ReasoningEffort`	Level of reasoning effort.

Field Summary

Fields
Modifier and Type	Field	Description

Constructor Summary

Constructors
Constructor	Description

Enum Constant Summary

Enum Constants
Enum Constant	Description

Method Summary

Modifier and Type	Method	Description
`final Optional<ReinforcementHyperparameters.BatchSize>`	`batchSize()`	Number of examples in each batch.
`final Optional<ReinforcementHyperparameters.ComputeMultiplier>`	`computeMultiplier()`	Multiplier on amount of compute used for exploring search space during training.
`final Optional<ReinforcementHyperparameters.EvalInterval>`	`evalInterval()`	The number of training steps between evaluation runs.
`final Optional<ReinforcementHyperparameters.EvalSamples>`	`evalSamples()`	Number of evaluation samples to generate per training step.
`final Optional<ReinforcementHyperparameters.LearningRateMultiplier>`	`learningRateMultiplier()`	Scaling factor for the learning rate.
`final Optional<ReinforcementHyperparameters.NEpochs>`	`nEpochs()`	The number of epochs to train the model for.
`final Optional<ReinforcementHyperparameters.ReasoningEffort>`	`reasoningEffort()`	Level of reasoning effort.
`final JsonField<ReinforcementHyperparameters.BatchSize>`	`_batchSize()`	Returns the raw JSON value of batchSize.
`final JsonField<ReinforcementHyperparameters.ComputeMultiplier>`	`_computeMultiplier()`	Returns the raw JSON value of computeMultiplier.
`final JsonField<ReinforcementHyperparameters.EvalInterval>`	`_evalInterval()`	Returns the raw JSON value of evalInterval.
`final JsonField<ReinforcementHyperparameters.EvalSamples>`	`_evalSamples()`	Returns the raw JSON value of evalSamples.
`final JsonField<ReinforcementHyperparameters.LearningRateMultiplier>`	`_learningRateMultiplier()`	Returns the raw JSON value of learningRateMultiplier.
`final JsonField<ReinforcementHyperparameters.NEpochs>`	`_nEpochs()`	Returns the raw JSON value of nEpochs.
`final JsonField<ReinforcementHyperparameters.ReasoningEffort>`	`_reasoningEffort()`	Returns the raw JSON value of reasoningEffort.
`final Map<String, JsonValue>`	`_additionalProperties()`
`final ReinforcementHyperparameters.Builder`	`toBuilder()`
`final ReinforcementHyperparameters`	`validate()`
`final Boolean`	`isValid()`
`Boolean`	`equals(Object other)`
`Integer`	`hashCode()`
`String`	`toString()`
`final static ReinforcementHyperparameters.Builder`	`builder()`	Returns a mutable builder for constructing an instance of ReinforcementHyperparameters.

Methods inherited from class java.lang.Object
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait

- Constructor Detail
- Method Detail
  - batchSize
```
 final Optional<ReinforcementHyperparameters.BatchSize> batchSize()
```
    Number of examples in each batch. A larger batch size means that model parameters are updated less frequently, but with lower variance.
  - computeMultiplier
```
 final Optional<ReinforcementHyperparameters.ComputeMultiplier> computeMultiplier()
```
    Multiplier on amount of compute used for exploring search space during training.
  - evalInterval
```
 final Optional<ReinforcementHyperparameters.EvalInterval> evalInterval()
```
    The number of training steps between evaluation runs.
  - evalSamples
```
 final Optional<ReinforcementHyperparameters.EvalSamples> evalSamples()
```
    Number of evaluation samples to generate per training step.
  - learningRateMultiplier
```
 final Optional<ReinforcementHyperparameters.LearningRateMultiplier> learningRateMultiplier()
```
    Scaling factor for the learning rate. A smaller learning rate may be useful to avoid overfitting.
  - nEpochs
```
 final Optional<ReinforcementHyperparameters.NEpochs> nEpochs()
```
    The number of epochs to train the model for. An epoch refers to one full cycle through the training dataset.
  - reasoningEffort
```
 final Optional<ReinforcementHyperparameters.ReasoningEffort> reasoningEffort()
```
    Level of reasoning effort.
  - _batchSize
```
 final JsonField<ReinforcementHyperparameters.BatchSize> _batchSize()
```
    Returns the raw JSON value of batchSize.
    Unlike batchSize, this method doesn't throw if the JSON field has an unexpected type.
  - _computeMultiplier
```
 final JsonField<ReinforcementHyperparameters.ComputeMultiplier> _computeMultiplier()
```
    Returns the raw JSON value of computeMultiplier.
    Unlike computeMultiplier, this method doesn't throw if the JSON field has an unexpected type.
  - _evalInterval
```
 final JsonField<ReinforcementHyperparameters.EvalInterval> _evalInterval()
```
    Returns the raw JSON value of evalInterval.
    Unlike evalInterval, this method doesn't throw if the JSON field has an unexpected type.
  - _evalSamples
```
 final JsonField<ReinforcementHyperparameters.EvalSamples> _evalSamples()
```
    Returns the raw JSON value of evalSamples.
    Unlike evalSamples, this method doesn't throw if the JSON field has an unexpected type.
  - _learningRateMultiplier
```
 final JsonField<ReinforcementHyperparameters.LearningRateMultiplier> _learningRateMultiplier()
```
    Returns the raw JSON value of learningRateMultiplier.
    Unlike learningRateMultiplier, this method doesn't throw if the JSON field has an unexpected type.
  - _nEpochs
```
 final JsonField<ReinforcementHyperparameters.NEpochs> _nEpochs()
```
    Returns the raw JSON value of nEpochs.
    Unlike nEpochs, this method doesn't throw if the JSON field has an unexpected type.
  - _reasoningEffort
```
 final JsonField<ReinforcementHyperparameters.ReasoningEffort> _reasoningEffort()
```
    Returns the raw JSON value of reasoningEffort.
    Unlike reasoningEffort, this method doesn't throw if the JSON field has an unexpected type.
  - _additionalProperties
```
 final Map<String, JsonValue> _additionalProperties()
```
  - toBuilder
```
 final ReinforcementHyperparameters.Builder toBuilder()
```
  - validate
```
 final ReinforcementHyperparameters validate()
```
  - isValid
```
 final Boolean isValid()
```
  - equals
```
 Boolean equals(Object other)
```
  - hashCode
```
 Integer hashCode()
```
  - toString
```
 String toString()
```
  - builder
```
 final static ReinforcementHyperparameters.Builder builder()
```
    Returns a mutable builder for constructing an instance of ReinforcementHyperparameters.

Class ReinforcementHyperparameters

Nested Class Summary

Field Summary

Constructor Summary

Enum Constant Summary

Method Summary

Methods inherited from class java.lang.Object

Constructor Detail

Method Detail

batchSize

computeMultiplier

evalInterval

evalSamples

learningRateMultiplier

nEpochs

reasoningEffort

_batchSize

_computeMultiplier

_evalInterval

_evalSamples

_learningRateMultiplier

_nEpochs

_reasoningEffort

_additionalProperties

toBuilder

validate

isValid

equals

hashCode

toString

builder