com.mongodb.spark.rdd.partitioner
The Sample Partitioner.
Uses the average document size and random sampling of the collection to determine suitable partitions for the collection.
The prefix when using sparkConf is: spark.mongodb.input.partitionerOptions followed by the property name:
sparkConf
spark.mongodb.input.partitionerOptions
_id
64
10
*Note:* Requires MongoDB 3.2+
1.0
The partition key property
The partition size MB property
Calculate the Partitions
the MongoConnector
the com.mongodb.spark.config.ReadConfig
the partitions
The number of samples for each partition
The Sample Partitioner.
Uses the average document size and random sampling of the collection to determine suitable partitions for the collection.
Configuration Properties
The prefix when using
sparkConf
is:spark.mongodb.input.partitionerOptions
followed by the property name:_id
.64
.10
.*Note:* Requires MongoDB 3.2+
1.0