A set of data which is used in the model optimization process.
A set of data which is used in the model optimization process. The FeatureSet can be access in a random data sample sequence. In the training process, the data sequence is a looped endless sequence. While in the validation process, the data sequence is a limited length sequence. User can use the data() method to get the data sequence.
The sequence of the data is not fixed. It can be changed by the shuffle() method.
Data type
Represent a sequence of data
Wrap a RDD as a FeatureSet.
Wrap a RDD as a FeatureSet.
Wrap a RDD as a FeatureSet. RDD will be persist on local disk, and will load one slice of the data to memory for the training.
Represent a distributed data.
Represent a distributed data. Use RDD to go through all data.