Function to set parameters before passing into the validation step eg - do data balancing or dropping based on the labels
Function to set parameters before passing into the validation step eg - do data balancing or dropping based on the labels
Parameters set in examining data
Maximum size of dataset want to train on.
Maximum size of dataset want to train on. Value should be > 0. Default is 1000000.
Fraction of data to reserve for test Default is 0.1
Fraction of data to reserve for test Default is 0.1
Seed for data splitting
Seed for data splitting
Function to use to create the training set and test set.
Function to use to create the training set and test set.
(dataTrain, dataTest)
Function to use to prepare the dataset for modeling within the validation step eg - do data balancing or dropping based on the labels
Function to use to prepare the dataset for modeling within the validation step eg - do data balancing or dropping based on the labels
Training set test set
Add a splitter parameter to name the label column
Abstract class that will carry on the creation of training set + test set