DataFrame-based machine learning APIs to let users quickly assemble and configure practical machine learning pipelines.
RDD-based machine learning APIs (in maintenance mode).
RDD-based machine learning APIs (in maintenance mode).
The spark.mllib
package is in maintenance mode as of the Spark 2.0.0 release to encourage
migration to the DataFrame-based APIs under the org.apache.spark.ml package.
While in maintenance mode,
spark.mllib
package will be accepted, unless they block
implementing new features in the DataFrame-based spark.ml
package;The developers will continue adding more features to the DataFrame-based APIs in the 2.x series to reach feature parity with the RDD-based APIs. And once we reach feature parity, this package will be deprecated.
SPARK-4591 to track the progress of feature parity