Class AccumuloFileOutputFormat


  • public class AccumuloFileOutputFormat
    extends org.apache.hadoop.mapreduce.lib.output.FileOutputFormat<Key,​Value>
    This class allows MapReduce jobs to write output in the Accumulo data file format.
    Care should be taken to write only sorted data (sorted by Key), as this is an important requirement of Accumulo data files. The output path to be created must be specified via configure(), which uses a fluent API. For Example:
     AccumuloFileOutputFormat.configure()
          .outputPath(path)
          .fileBlockSize(b)
          .compression(type)
          .summarizers(sc1, sc2).store(job));
     
    For all available options see FileOutputFormatBuilder. Methods inherited from FileOutputFormat are not supported and may be ignored or cause failures. Using other Hadoop configuration options that affect the behavior of the underlying files directly in the Job's configuration may work, but are not directly supported at this time.
    Since:
    2.0
    • Nested Class Summary

      • Nested classes/interfaces inherited from class org.apache.hadoop.mapreduce.lib.output.FileOutputFormat

        org.apache.hadoop.mapreduce.lib.output.FileOutputFormat.Counter
    • Field Summary

      • Fields inherited from class org.apache.hadoop.mapreduce.lib.output.FileOutputFormat

        BASE_OUTPUT_NAME, COMPRESS, COMPRESS_CODEC, COMPRESS_TYPE, OUTDIR, PART
    • Constructor Detail

      • AccumuloFileOutputFormat

        public AccumuloFileOutputFormat()
    • Method Detail

      • getRecordWriter

        public org.apache.hadoop.mapreduce.RecordWriter<Key,​Value> getRecordWriter​(org.apache.hadoop.mapreduce.TaskAttemptContext context)
                                                                                  throws IOException
        Specified by:
        getRecordWriter in class org.apache.hadoop.mapreduce.lib.output.FileOutputFormat<Key,​Value>
        Throws:
        IOException