Loads labeled data in the LIBSVM format into an SCollection[(Double, SparseVector)].
Loads labeled data in the LIBSVM format into an SCollection[(Double, SparseVector)]. The LIBSVM format is a text-based format used by LIBSVM and LIBLINEAR. Each line represents a labeled sparse feature vector using the following format: [label index1:value1 index2:value2 ...] where the indices are one-based and in ascending order.
file or directory path in any Hadoop-supported file system URI
number of features, which will be determined from the input data if a nonpositive value is given. This is useful when the data is split into multiple files and you want to load them separately, because some features may not present in certain files, which leads to inconsistent feature dimensions.
labeled data stored as an SCollection[(Double, SparseVector)]