Package net.sansa_stack.spark.rdd.op.rdf
Class JavaRddOfNamedModelsOps
java.lang.Object
net.sansa_stack.spark.rdd.op.rdf.JavaRddOfNamedModelsOps
Operations on the RDD[(String, Model)] type.
The naming "RddOfNamedModelOps" allows for future introduction of "RddOfNamedGraphOps" in case there is demand
for these operations on Jena's Graph level.
-
Constructor Summary
-
Method Summary
Modifier and TypeMethodDescriptionstatic <K> org.apache.spark.api.java.JavaPairRDD<K,
org.apache.jena.rdf.model.Model> groupNamedModels
(org.apache.spark.api.java.JavaPairRDD<K, org.apache.jena.rdf.model.Model> rdd, boolean distinct, boolean sortGraphsByIri, int numPartitions) Group and/or sort named models by their graph iristatic org.apache.spark.api.java.JavaRDD<org.aksw.jenax.arq.dataset.api.DatasetOneNg>
mapToDatasets
(org.apache.spark.api.java.JavaPairRDD<String, org.apache.jena.rdf.model.Model> rdd) Map each (name, model) pair to a dataset with the same informationstatic org.apache.spark.api.java.JavaRDD<org.aksw.jenax.arq.dataset.api.ResourceInDataset>
mapToResourceInDataset
(org.apache.spark.api.java.JavaPairRDD<String, org.apache.jena.rdf.model.Model> rdd) static org.apache.spark.api.java.JavaRDD<org.apache.jena.rdf.model.Resource>
mapToResources
(org.apache.spark.api.java.JavaPairRDD<String, org.apache.jena.rdf.model.Model> rdd)
-
Constructor Details
-
JavaRddOfNamedModelsOps
public JavaRddOfNamedModelsOps()
-
-
Method Details
-
groupNamedModels
public static <K> org.apache.spark.api.java.JavaPairRDD<K,org.apache.jena.rdf.model.Model> groupNamedModels(org.apache.spark.api.java.JavaPairRDD<K, org.apache.jena.rdf.model.Model> rdd, boolean distinct, boolean sortGraphsByIri, int numPartitions) Group and/or sort named models by their graph iri- Parameters:
rdd
-distinct
- If false then models with the same key remain separated otherwise they become mergedsortGraphsByIri
- Whether to apply sorting in addition to groupingnumPartitions
- Number of partitions to use for sorting; only applicable if sortGraphsByIri is true.- Returns:
-
mapToDatasets
public static org.apache.spark.api.java.JavaRDD<org.aksw.jenax.arq.dataset.api.DatasetOneNg> mapToDatasets(org.apache.spark.api.java.JavaPairRDD<String, org.apache.jena.rdf.model.Model> rdd) Map each (name, model) pair to a dataset with the same information- Parameters:
rdd
-- Returns:
-
mapToResources
public static org.apache.spark.api.java.JavaRDD<org.apache.jena.rdf.model.Resource> mapToResources(org.apache.spark.api.java.JavaPairRDD<String, org.apache.jena.rdf.model.Model> rdd) -
mapToResourceInDataset
public static org.apache.spark.api.java.JavaRDD<org.aksw.jenax.arq.dataset.api.ResourceInDataset> mapToResourceInDataset(org.apache.spark.api.java.JavaPairRDD<String, org.apache.jena.rdf.model.Model> rdd)
-