Package net.sansa_stack.spark.rdd.op.rdf
Class JavaRddOfQuadsOps
java.lang.Object
net.sansa_stack.spark.rdd.op.rdf.JavaRddOfQuadsOps
-
Constructor Summary
-
Method Summary
Modifier and TypeMethodDescriptionstatic org.apache.spark.api.java.JavaPairRDD<String,
org.apache.jena.rdf.model.Model> groupByNamedGraph
(org.apache.spark.api.java.JavaRDD<org.apache.jena.sparql.core.Quad> rdd) group quads by graph IRI into a pairs (graphIri, Model)static JavaRddFunction<org.apache.jena.sparql.core.Quad,
org.apache.jena.sparql.core.Quad> mapIntoGraph
(org.apache.jena.graph.Node graphNode) static org.apache.spark.api.java.JavaRDD<org.apache.jena.query.Dataset>
mapToDataset
(org.apache.spark.api.java.JavaRDD<org.apache.jena.sparql.core.Quad> rdd) static JavaRddFunction<org.apache.jena.sparql.core.Quad,
org.apache.jena.graph.Triple> mapToTriples
(org.apache.jena.graph.Node graphNode) static org.apache.spark.api.java.JavaRDD<org.apache.jena.sparql.core.Quad>
postProcess
(org.apache.spark.api.java.JavaRDD<org.apache.jena.sparql.core.Quad> rdd, boolean sort, boolean ascending, boolean distinct, int numPartitions) Post process RDF data - sort, distinct, repartition Sort quads by their string representation (relies onNodeFmtLib.str(org.apache.jena.graph.Triple)
)
-
Constructor Details
-
JavaRddOfQuadsOps
public JavaRddOfQuadsOps()
-
-
Method Details
-
groupByNamedGraph
public static org.apache.spark.api.java.JavaPairRDD<String,org.apache.jena.rdf.model.Model> groupByNamedGraph(org.apache.spark.api.java.JavaRDD<org.apache.jena.sparql.core.Quad> rdd) group quads by graph IRI into a pairs (graphIri, Model) -
mapToDataset
public static org.apache.spark.api.java.JavaRDD<org.apache.jena.query.Dataset> mapToDataset(org.apache.spark.api.java.JavaRDD<org.apache.jena.sparql.core.Quad> rdd) -
mapIntoGraph
public static JavaRddFunction<org.apache.jena.sparql.core.Quad,org.apache.jena.sparql.core.Quad> mapIntoGraph(org.apache.jena.graph.Node graphNode) -
mapToTriples
public static JavaRddFunction<org.apache.jena.sparql.core.Quad,org.apache.jena.graph.Triple> mapToTriples(org.apache.jena.graph.Node graphNode) -
postProcess
public static org.apache.spark.api.java.JavaRDD<org.apache.jena.sparql.core.Quad> postProcess(org.apache.spark.api.java.JavaRDD<org.apache.jena.sparql.core.Quad> rdd, boolean sort, boolean ascending, boolean distinct, int numPartitions) Post process RDF data - sort, distinct, repartition Sort quads by their string representation (relies onNodeFmtLib.str(org.apache.jena.graph.Triple)
)
-