com.mongodb.spark.rdd.partitioner
The Sample Partitioner.
Uses the average document size and random sampling of the collection to determine suitable partitions for the collection.
The prefix when using sparkConf is: spark.mongodb.input.partitionerOptions followed by the property name:
sparkConf
spark.mongodb.input.partitionerOptions
_id
64
10
*Note:* Requires MongoDB 3.2+ *Note:* Does not support views. Use MongoPaginateByCountPartitioner or create a custom partitioner.
MongoPaginateByCountPartitioner
1.0
The partition key property
The partition size MB property
Calculate the Partitions
the MongoConnector
the com.mongodb.spark.config.ReadConfig
the partitions
The number of samples for each partition
The Sample Partitioner.
Uses the average document size and random sampling of the collection to determine suitable partitions for the collection.
Configuration Properties
The prefix when using
sparkConf
is:spark.mongodb.input.partitionerOptions
followed by the property name:_id
.64
.10
.*Note:* Requires MongoDB 3.2+ *Note:* Does not support views. Use
MongoPaginateByCountPartitioner
or create a custom partitioner.1.0