:: Experimental :: Represents a classification model that predicts to which of a set of categories an example belongs.
Classification model trained using Logistic Regression.
Train a classification model for Logistic Regression using Limited-memory BFGS.
Train a classification model for Logistic Regression using Limited-memory BFGS. Standard feature scaling and L2 regularization are used by default. NOTE: Labels used in Logistic Regression should be {0, 1}
Train a classification model for Logistic Regression using Stochastic Gradient Descent.
Train a classification model for Logistic Regression using Stochastic Gradient Descent. By default L2 regularization is used, which can be changed via LogisticRegressionWithSGD.optimizer. NOTE: Labels used in Logistic Regression should be {0, 1}. Using LogisticRegressionWithLBFGS is recommended over this.
Trains a Naive Bayes model given an RDD of (label, features)
pairs.
Trains a Naive Bayes model given an RDD of (label, features)
pairs.
This is the Multinomial NB (http://tinyurl.com/lsdw6p) which can handle all kinds of discrete data. For example, by converting documents into TF-IDF vectors, it can be used for document classification. By making every vector a 0-1 vector, it can also be used as Bernoulli NB (http://tinyurl.com/p7c96j6). The input feature values must be nonnegative.
Model for Naive Bayes Classifiers.
Model for Support Vector Machines (SVMs).
Train a Support Vector Machine (SVM) using Stochastic Gradient Descent.
Train a Support Vector Machine (SVM) using Stochastic Gradient Descent. By default L2 regularization is used, which can be changed via SVMWithSGD.optimizer. NOTE: Labels used in SVM should be {0, 1}.
Top-level methods for calling Logistic Regression using Stochastic Gradient Descent.
Top-level methods for calling Logistic Regression using Stochastic Gradient Descent. NOTE: Labels used in Logistic Regression should be {0, 1}
Top-level methods for calling naive Bayes.
Top-level methods for calling SVM.
Top-level methods for calling SVM. NOTE: Labels used in SVM should be {0, 1}.
:: Experimental :: Represents a classification model that predicts to which of a set of categories an example belongs. The categories are represented by double values: 0.0, 1.0, 2.0, etc.