Package com.openai.models.finetuning.methods
See: Description
-
Class Summary Class Description Builder A builder for DpoHyperparameters. BatchSize Number of examples in each batch. Beta The beta value for the DPO method. LearningRateMultiplier Scaling factor for the learning rate. NEpochs The number of epochs to train the model for. DpoHyperparameters The hyperparameters used for the DPO fine-tuning job. Builder A builder for DpoMethod. DpoMethod Configuration for the DPO fine-tuning method. Builder A builder for ReinforcementHyperparameters. BatchSize Number of examples in each batch. ComputeMultiplier Multiplier on amount of compute used for exploring search space during training. EvalInterval The number of training steps between evaluation runs. EvalSamples Number of evaluation samples to generate per training step. LearningRateMultiplier Scaling factor for the learning rate. NEpochs The number of epochs to train the model for. ReasoningEffort Level of reasoning effort. ReinforcementHyperparameters The hyperparameters used for the reinforcement fine-tuning job. Builder A builder for ReinforcementMethod. Grader The grader used for the fine-tuning job. ReinforcementMethod Configuration for the reinforcement fine-tuning method. Builder A builder for SupervisedHyperparameters. BatchSize Number of examples in each batch. LearningRateMultiplier Scaling factor for the learning rate. NEpochs The number of epochs to train the model for. SupervisedHyperparameters The hyperparameters used for the fine-tuning job. Builder A builder for SupervisedMethod. SupervisedMethod Configuration for the supervised fine-tuning method. -
Interface Summary Interface Description Visitor An interface that defines how to map each variant of BatchSize to a value of type T. Visitor An interface that defines how to map each variant of Beta to a value of type T. Visitor An interface that defines how to map each variant of LearningRateMultiplier to a value of type T. Visitor An interface that defines how to map each variant of NEpochs to a value of type T. Visitor An interface that defines how to map each variant of BatchSize to a value of type T. Visitor An interface that defines how to map each variant of ComputeMultiplier to a value of type T. Visitor An interface that defines how to map each variant of EvalInterval to a value of type T. Visitor An interface that defines how to map each variant of EvalSamples to a value of type T. Visitor An interface that defines how to map each variant of LearningRateMultiplier to a value of type T. Visitor An interface that defines how to map each variant of NEpochs to a value of type T. Visitor An interface that defines how to map each variant of Grader to a value of type T. Visitor An interface that defines how to map each variant of BatchSize to a value of type T. Visitor An interface that defines how to map each variant of LearningRateMultiplier to a value of type T. Visitor An interface that defines how to map each variant of NEpochs to a value of type T. -
Enum Summary Enum Description Known An enum containing ReasoningEffort's known values. Value An enum containing ReasoningEffort's known values, as well as an _UNKNOWN member.