aggregate pipeline
Allows to copying of this RDD with changing some of the properties
Creates a DataFrame
based on the provided schema.
Creates a DataFrame
based on the provided schema.
the schema representing the DataFrame.
a DataFrame.
Creates a DataFrame
based on the schema derived from the bean class.
Creates a DataFrame
based on the schema derived from the bean class.
Note: Prefer toDS[T](beanClass:Class[T])* as computations will be more efficient.
The bean class type to shape the data from MongoDB into
encapsulating the data from MongoDB
a DataFrame
Creates a DataFrame
based on the schema derived from the optional type.
Creates a DataFrame
based on the schema derived from the optional type.
Note: Prefer toDS[T<:Product]()* as computations will be more efficient.
The rdd must contain an _id
for MongoDB versions < 3.2.
The optional type of the data from MongoDB, if not provided the schema will be inferred from the collection
a DataFrame
Creates a Dataset
from the RDD strongly typed to the provided java bean.
Creates a Dataset
from the RDD strongly typed to the provided java bean.
The type of the data from MongoDB
Creates a Dataset
from the collection strongly typed to the provided case class.
Creates a Dataset
from the collection strongly typed to the provided case class.
The type of the data from MongoDB
Returns a copy with the specified aggregation pipeline
Returns a copy with the specified aggregation pipeline
the aggregation pipeline to use
the updated MongoRDD
(Since version 1.0.0) use mapPartitionsWithIndex and filter
(Since version 1.0.0) use mapPartitionsWithIndex and flatMap
(Since version 1.0.0) use mapPartitionsWithIndex and foreach
(Since version 1.2.0) use TaskContext.get
(Since version 0.7.0) use mapPartitionsWithIndex
(Since version 1.0.0) use mapPartitionsWithIndex
(Since version 1.0.0) use collect
MongoRDD Class
the type of the collection documents