Class GroupReadSupport


  • public class GroupReadSupport
    extends ReadSupport<org.apache.parquet.example.data.Group>
    • Constructor Detail

      • GroupReadSupport

        public GroupReadSupport()
    • Method Detail

      • init

        public ReadSupport.ReadContext init​(org.apache.hadoop.conf.Configuration configuration,
                                            Map<String,​String> keyValueMetaData,
                                            org.apache.parquet.schema.MessageType fileSchema)
        Description copied from class: ReadSupport
        called in InputFormat.getSplits(org.apache.hadoop.mapreduce.JobContext) in the front end
        Overrides:
        init in class ReadSupport<org.apache.parquet.example.data.Group>
        Parameters:
        configuration - the job configuration
        keyValueMetaData - the app specific metadata from the file
        fileSchema - the schema of the file
        Returns:
        the readContext that defines how to read the file
      • prepareForRead

        public org.apache.parquet.io.api.RecordMaterializer<org.apache.parquet.example.data.Group> prepareForRead​(org.apache.hadoop.conf.Configuration configuration,
                                                                                                                  Map<String,​String> keyValueMetaData,
                                                                                                                  org.apache.parquet.schema.MessageType fileSchema,
                                                                                                                  ReadSupport.ReadContext readContext)
        Description copied from class: ReadSupport
        called in RecordReader.initialize(org.apache.hadoop.mapreduce.InputSplit, org.apache.hadoop.mapreduce.TaskAttemptContext) in the back end the returned RecordMaterializer will materialize the records and add them to the destination
        Specified by:
        prepareForRead in class ReadSupport<org.apache.parquet.example.data.Group>
        Parameters:
        configuration - the job configuration
        keyValueMetaData - the app specific metadata from the file
        fileSchema - the schema of the file
        readContext - returned by the init method
        Returns:
        the recordMaterializer that will materialize the records