Class DpoHyperparameters
- 
                    
                    - All Implemented Interfaces:
 
 public final class DpoHyperparametersThe hyperparameters used for the DPO fine-tuning job. 
- 
                
                    
                    - 
                                
                            
                                Nested Class SummaryNested Classes Modifier and Type Class Description public final classDpoHyperparameters.BuilderA builder for DpoHyperparameters. public final classDpoHyperparameters.BatchSizeNumber of examples in each batch. A larger batch size means that model parameters are updated less frequently, but with lower variance. public final classDpoHyperparameters.BetaThe beta value for the DPO method. A higher beta value will increase the weight of the penalty between the policy and reference model. public final classDpoHyperparameters.LearningRateMultiplierScaling factor for the learning rate. A smaller learning rate may be useful to avoid overfitting. public final classDpoHyperparameters.NEpochsThe number of epochs to train the model for. An epoch refers to one full cycle through the training dataset. 
 - 
                                
                            
                                Method Summary- 
                    
                    
                    - 
                                
                            
                                Method Detail- 
                                        batchSizefinal Optional<DpoHyperparameters.BatchSize> batchSize() Number of examples in each batch. A larger batch size means that model parameters are updated less frequently, but with lower variance. 
 - 
                                        betafinal Optional<DpoHyperparameters.Beta> beta() The beta value for the DPO method. A higher beta value will increase the weight of the penalty between the policy and reference model. 
 - 
                                        learningRateMultiplierfinal Optional<DpoHyperparameters.LearningRateMultiplier> learningRateMultiplier() Scaling factor for the learning rate. A smaller learning rate may be useful to avoid overfitting. 
 - 
                                        nEpochsfinal Optional<DpoHyperparameters.NEpochs> nEpochs() The number of epochs to train the model for. An epoch refers to one full cycle through the training dataset. 
 - 
                                        _batchSizefinal JsonField<DpoHyperparameters.BatchSize> _batchSize() Returns the raw JSON value of batchSize. Unlike batchSize, this method doesn't throw if the JSON field has an unexpected type. 
 - 
                                        _betafinal JsonField<DpoHyperparameters.Beta> _beta() Returns the raw JSON value of beta. Unlike beta, this method doesn't throw if the JSON field has an unexpected type. 
 - 
                                        _learningRateMultiplierfinal JsonField<DpoHyperparameters.LearningRateMultiplier> _learningRateMultiplier() Returns the raw JSON value of learningRateMultiplier. Unlike learningRateMultiplier, this method doesn't throw if the JSON field has an unexpected type. 
 - 
                                        _nEpochsfinal JsonField<DpoHyperparameters.NEpochs> _nEpochs() Returns the raw JSON value of nEpochs. Unlike nEpochs, this method doesn't throw if the JSON field has an unexpected type. 
 - 
                                        _additionalPropertiesfinal Map<String, JsonValue> _additionalProperties() 
 - 
                                        toBuilderfinal DpoHyperparameters.Builder toBuilder() 
 - 
                                        validatefinal DpoHyperparameters validate() 
 - 
                                        builderfinal static DpoHyperparameters.Builder builder() Returns a mutable builder for constructing an instance of DpoHyperparameters. 
 
- 
                                        
 
- 
                                
                            
                                
 
- 
                    
                    
                    
 
-