Class RddRdfWriterFactory
java.lang.Object
net.sansa_stack.spark.io.rdf.output.RddWriterSettings<SELF>
net.sansa_stack.spark.io.rdf.output.RddRdfWriterSettings<RddRdfWriterFactory>
net.sansa_stack.spark.io.rdf.output.RddRdfWriterFactory
A factory for
RddRdfWriter
instances which enables validation of settings at an early stage
using validate()
.
Using RddRdfWriter
directly may cause misconfigurations to only be detected at the near end of a long
running spark job, such as when attempting to write out the result of a long running sort operation.-
Field Summary
Fields inherited from class net.sansa_stack.spark.io.rdf.output.RddRdfWriterSettings
deferOutputForUsedPrefixes, globalPrefixMapping, mapQuadsToTriplesForTripleLangs, outputFormat
Fields inherited from class net.sansa_stack.spark.io.rdf.output.RddWriterSettings
allowOverwriteFiles, consoleOutSupplier, deletePartitionFolderAfterMerge, hadoopConfiguration, partitionFolder, partitionFolderFs, partitionsAsIndependentFiles, postProcessingSettings, targetFile, targetFileFs, useCoalesceOne
-
Constructor Summary
-
Method Summary
Modifier and TypeMethodDescriptionstatic RddRdfWriterFactory
create()
RddRdfWriter<org.aksw.jenax.arq.dataset.api.DatasetOneNg>
forDataset
(org.apache.spark.api.java.JavaRDD<? extends org.aksw.jenax.arq.dataset.api.DatasetOneNg> rdd) RddRdfWriter<org.aksw.jenax.arq.dataset.api.DatasetGraphOneNg>
forDatasetGraph
(org.apache.spark.api.java.JavaRDD<? extends org.aksw.jenax.arq.dataset.api.DatasetGraphOneNg> rdd) RddRdfWriter<org.apache.jena.graph.Graph>
forGraph
(org.apache.spark.api.java.JavaRDD<? extends org.apache.jena.graph.Graph> rdd) RddRdfWriter<org.apache.jena.rdf.model.Model>
forModel
(org.apache.spark.api.java.JavaRDD<? extends org.apache.jena.rdf.model.Model> rdd) RddRdfWriter<org.apache.jena.sparql.core.Quad>
forQuad
(org.apache.spark.api.java.JavaRDD<? extends org.apache.jena.sparql.core.Quad> rdd) RddRdfWriter<org.apache.jena.graph.Triple>
forTriple
(org.apache.spark.api.java.JavaRDD<? extends org.apache.jena.graph.Triple> rdd) protected RddRdfWriterFactory
self()
validate()
Methods inherited from class net.sansa_stack.spark.io.rdf.output.RddRdfWriterSettings
configureFrom, getFallbackOutputFormat, getGlobalPrefixMapping, getOutputFormat, isMapQuadsToTriplesForTripleLangs, isPartitionsAsIndependentFiles, mutate, setDeferOutputForUsedPrefixes, setGlobalPrefixMapping, setGlobalPrefixMapping, setMapQuadsToTriplesForTripleLangs, setOutputFormat, setOutputFormat, setPartitionsAsIndependentFiles
Methods inherited from class net.sansa_stack.spark.io.rdf.output.RddWriterSettings
configureFrom, getConsoleOutSupplier, getHadoopConfiguration, getPartitionFolder, getPartitionFolderFs, getPostProcessingSettings, getTargetFile, getTargetFileFs, isAllowOverwriteFiles, isConsoleOutput, isDeletePartitionFolderAfterMerge, isUseCoalesceOne, setAllowOverwriteFiles, setConsoleOutput, setConsoleOutSupplier, setDeletePartitionFolderAfterMerge, setHadoopConfiguration, setPartitionFolder, setPartitionFolder, setPartitionFolderFs, setPostProcessingSettings, setTargetFile, setTargetFile, setTargetFileFs, setUseCoalesceOne
-
Constructor Details
-
RddRdfWriterFactory
public RddRdfWriterFactory()
-
-
Method Details
-
create
-
self
- Overrides:
self
in classRddRdfWriterSettings<RddRdfWriterFactory>
-
validate
-
forTriple
public RddRdfWriter<org.apache.jena.graph.Triple> forTriple(org.apache.spark.api.java.JavaRDD<? extends org.apache.jena.graph.Triple> rdd) -
forQuad
public RddRdfWriter<org.apache.jena.sparql.core.Quad> forQuad(org.apache.spark.api.java.JavaRDD<? extends org.apache.jena.sparql.core.Quad> rdd) -
forGraph
public RddRdfWriter<org.apache.jena.graph.Graph> forGraph(org.apache.spark.api.java.JavaRDD<? extends org.apache.jena.graph.Graph> rdd) -
forDatasetGraph
public RddRdfWriter<org.aksw.jenax.arq.dataset.api.DatasetGraphOneNg> forDatasetGraph(org.apache.spark.api.java.JavaRDD<? extends org.aksw.jenax.arq.dataset.api.DatasetGraphOneNg> rdd) -
forModel
public RddRdfWriter<org.apache.jena.rdf.model.Model> forModel(org.apache.spark.api.java.JavaRDD<? extends org.apache.jena.rdf.model.Model> rdd) -
forDataset
public RddRdfWriter<org.aksw.jenax.arq.dataset.api.DatasetOneNg> forDataset(org.apache.spark.api.java.JavaRDD<? extends org.aksw.jenax.arq.dataset.api.DatasetOneNg> rdd)
-