Class JavaRddOfNamedModelsOps

java.lang.Object
net.sansa_stack.spark.rdd.op.rdf.JavaRddOfNamedModelsOps

public class JavaRddOfNamedModelsOps extends Object
Operations on the RDD[(String, Model)] type. The naming "RddOfNamedModelOps" allows for future introduction of "RddOfNamedGraphOps" in case there is demand for these operations on Jena's Graph level.
  • Constructor Summary

    Constructors
    Constructor
    Description
     
  • Method Summary

    Modifier and Type
    Method
    Description
    static <K> org.apache.spark.api.java.JavaPairRDD<K,org.apache.jena.rdf.model.Model>
    groupNamedModels(org.apache.spark.api.java.JavaPairRDD<K,org.apache.jena.rdf.model.Model> rdd, boolean distinct, boolean sortGraphsByIri, int numPartitions)
    Group and/or sort named models by their graph iri
    static org.apache.spark.api.java.JavaRDD<org.aksw.jenax.arq.dataset.api.DatasetOneNg>
    mapToDatasets(org.apache.spark.api.java.JavaPairRDD<String,org.apache.jena.rdf.model.Model> rdd)
    Map each (name, model) pair to a dataset with the same information
    static org.apache.spark.api.java.JavaRDD<org.aksw.jenax.arq.dataset.api.ResourceInDataset>
    mapToResourceInDataset(org.apache.spark.api.java.JavaPairRDD<String,org.apache.jena.rdf.model.Model> rdd)
     
    static org.apache.spark.api.java.JavaRDD<org.apache.jena.rdf.model.Resource>
    mapToResources(org.apache.spark.api.java.JavaPairRDD<String,org.apache.jena.rdf.model.Model> rdd)
     

    Methods inherited from class java.lang.Object

    clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait
  • Constructor Details

    • JavaRddOfNamedModelsOps

      public JavaRddOfNamedModelsOps()
  • Method Details

    • groupNamedModels

      public static <K> org.apache.spark.api.java.JavaPairRDD<K,org.apache.jena.rdf.model.Model> groupNamedModels(org.apache.spark.api.java.JavaPairRDD<K,org.apache.jena.rdf.model.Model> rdd, boolean distinct, boolean sortGraphsByIri, int numPartitions)
      Group and/or sort named models by their graph iri
      Parameters:
      rdd -
      distinct - If false then models with the same key remain separated otherwise they become merged
      sortGraphsByIri - Whether to apply sorting in addition to grouping
      numPartitions - Number of partitions to use for sorting; only applicable if sortGraphsByIri is true.
      Returns:
    • mapToDatasets

      public static org.apache.spark.api.java.JavaRDD<org.aksw.jenax.arq.dataset.api.DatasetOneNg> mapToDatasets(org.apache.spark.api.java.JavaPairRDD<String,org.apache.jena.rdf.model.Model> rdd)
      Map each (name, model) pair to a dataset with the same information
      Parameters:
      rdd -
      Returns:
    • mapToResources

      public static org.apache.spark.api.java.JavaRDD<org.apache.jena.rdf.model.Resource> mapToResources(org.apache.spark.api.java.JavaPairRDD<String,org.apache.jena.rdf.model.Model> rdd)
    • mapToResourceInDataset

      public static org.apache.spark.api.java.JavaRDD<org.aksw.jenax.arq.dataset.api.ResourceInDataset> mapToResourceInDataset(org.apache.spark.api.java.JavaPairRDD<String,org.apache.jena.rdf.model.Model> rdd)