Class FineTuningJob.Method.Dpo.Hyperparameters.Beta

  • All Implemented Interfaces:

    
    public final class FineTuningJob.Method.Dpo.Hyperparameters.Beta
    
                        

    The beta value for the DPO method. A higher beta value will increase the weight of the penalty between the policy and reference model.