Class FineTuningJobCreateParams.Method.Dpo.Hyperparameters.Builder
-
- All Implemented Interfaces:
public final class FineTuningJobCreateParams.Method.Dpo.Hyperparameters.Builder
A builder for Hyperparameters.
-
-
Method Summary
-
-
Method Detail
-
batchSize
final FineTuningJobCreateParams.Method.Dpo.Hyperparameters.Builder batchSize(FineTuningJobCreateParams.Method.Dpo.Hyperparameters.BatchSize batchSize)
Number of examples in each batch. A larger batch size means that model parameters are updated less frequently, but with lower variance.
-
batchSize
final FineTuningJobCreateParams.Method.Dpo.Hyperparameters.Builder batchSize(JsonField<FineTuningJobCreateParams.Method.Dpo.Hyperparameters.BatchSize> batchSize)
Number of examples in each batch. A larger batch size means that model parameters are updated less frequently, but with lower variance.
-
batchSize
final FineTuningJobCreateParams.Method.Dpo.Hyperparameters.Builder batchSize(Long manual)
Number of examples in each batch. A larger batch size means that model parameters are updated less frequently, but with lower variance.
-
batchSizeAuto
final FineTuningJobCreateParams.Method.Dpo.Hyperparameters.Builder batchSizeAuto()
Number of examples in each batch. A larger batch size means that model parameters are updated less frequently, but with lower variance.
-
beta
final FineTuningJobCreateParams.Method.Dpo.Hyperparameters.Builder beta(FineTuningJobCreateParams.Method.Dpo.Hyperparameters.Beta beta)
The beta value for the DPO method. A higher beta value will increase the weight of the penalty between the policy and reference model.
-
beta
final FineTuningJobCreateParams.Method.Dpo.Hyperparameters.Builder beta(JsonField<FineTuningJobCreateParams.Method.Dpo.Hyperparameters.Beta> beta)
The beta value for the DPO method. A higher beta value will increase the weight of the penalty between the policy and reference model.
-
beta
final FineTuningJobCreateParams.Method.Dpo.Hyperparameters.Builder beta(Double manual)
The beta value for the DPO method. A higher beta value will increase the weight of the penalty between the policy and reference model.
-
betaAuto
final FineTuningJobCreateParams.Method.Dpo.Hyperparameters.Builder betaAuto()
The beta value for the DPO method. A higher beta value will increase the weight of the penalty between the policy and reference model.
-
learningRateMultiplier
final FineTuningJobCreateParams.Method.Dpo.Hyperparameters.Builder learningRateMultiplier(FineTuningJobCreateParams.Method.Dpo.Hyperparameters.LearningRateMultiplier learningRateMultiplier)
Scaling factor for the learning rate. A smaller learning rate may be useful to avoid overfitting.
-
learningRateMultiplier
final FineTuningJobCreateParams.Method.Dpo.Hyperparameters.Builder learningRateMultiplier(JsonField<FineTuningJobCreateParams.Method.Dpo.Hyperparameters.LearningRateMultiplier> learningRateMultiplier)
Scaling factor for the learning rate. A smaller learning rate may be useful to avoid overfitting.
-
learningRateMultiplier
final FineTuningJobCreateParams.Method.Dpo.Hyperparameters.Builder learningRateMultiplier(Double manual)
Scaling factor for the learning rate. A smaller learning rate may be useful to avoid overfitting.
-
learningRateMultiplierAuto
final FineTuningJobCreateParams.Method.Dpo.Hyperparameters.Builder learningRateMultiplierAuto()
Scaling factor for the learning rate. A smaller learning rate may be useful to avoid overfitting.
-
nEpochs
final FineTuningJobCreateParams.Method.Dpo.Hyperparameters.Builder nEpochs(FineTuningJobCreateParams.Method.Dpo.Hyperparameters.NEpochs nEpochs)
The number of epochs to train the model for. An epoch refers to one full cycle through the training dataset.
-
nEpochs
final FineTuningJobCreateParams.Method.Dpo.Hyperparameters.Builder nEpochs(JsonField<FineTuningJobCreateParams.Method.Dpo.Hyperparameters.NEpochs> nEpochs)
The number of epochs to train the model for. An epoch refers to one full cycle through the training dataset.
-
nEpochs
final FineTuningJobCreateParams.Method.Dpo.Hyperparameters.Builder nEpochs(Long manual)
The number of epochs to train the model for. An epoch refers to one full cycle through the training dataset.
-
nEpochsAuto
final FineTuningJobCreateParams.Method.Dpo.Hyperparameters.Builder nEpochsAuto()
The number of epochs to train the model for. An epoch refers to one full cycle through the training dataset.
-
additionalProperties
final FineTuningJobCreateParams.Method.Dpo.Hyperparameters.Builder additionalProperties(Map<String, JsonValue> additionalProperties)
-
putAdditionalProperty
final FineTuningJobCreateParams.Method.Dpo.Hyperparameters.Builder putAdditionalProperty(String key, JsonValue value)
-
putAllAdditionalProperties
final FineTuningJobCreateParams.Method.Dpo.Hyperparameters.Builder putAllAdditionalProperties(Map<String, JsonValue> additionalProperties)
-
removeAdditionalProperty
final FineTuningJobCreateParams.Method.Dpo.Hyperparameters.Builder removeAdditionalProperty(String key)
-
removeAllAdditionalProperties
final FineTuningJobCreateParams.Method.Dpo.Hyperparameters.Builder removeAllAdditionalProperties(Set<String> keys)
-
build
final FineTuningJobCreateParams.Method.Dpo.Hyperparameters build()
-
-
-
-