Class ParquetWriter<T>

    • Constructor Summary

      Constructors 
      Constructor Description
      ParquetWriter​(org.apache.hadoop.fs.Path file, org.apache.hadoop.conf.Configuration conf, WriteSupport<T> writeSupport)
      Deprecated.
      ParquetWriter​(org.apache.hadoop.fs.Path file, WriteSupport<T> writeSupport)
      Deprecated.
      will be removed in 2.0.0
      ParquetWriter​(org.apache.hadoop.fs.Path file, WriteSupport<T> writeSupport, org.apache.parquet.hadoop.metadata.CompressionCodecName compressionCodecName, int blockSize, int pageSize)
      Deprecated.
      will be removed in 2.0.0
      ParquetWriter​(org.apache.hadoop.fs.Path file, WriteSupport<T> writeSupport, org.apache.parquet.hadoop.metadata.CompressionCodecName compressionCodecName, int blockSize, int pageSize, boolean enableDictionary, boolean validating)
      Deprecated.
      will be removed in 2.0.0
      ParquetWriter​(org.apache.hadoop.fs.Path file, WriteSupport<T> writeSupport, org.apache.parquet.hadoop.metadata.CompressionCodecName compressionCodecName, int blockSize, int pageSize, int dictionaryPageSize, boolean enableDictionary, boolean validating)
      Deprecated.
      will be removed in 2.0.0
      ParquetWriter​(org.apache.hadoop.fs.Path file, WriteSupport<T> writeSupport, org.apache.parquet.hadoop.metadata.CompressionCodecName compressionCodecName, int blockSize, int pageSize, int dictionaryPageSize, boolean enableDictionary, boolean validating, org.apache.parquet.column.ParquetProperties.WriterVersion writerVersion)
      Deprecated.
      will be removed in 2.0.0
      ParquetWriter​(org.apache.hadoop.fs.Path file, WriteSupport<T> writeSupport, org.apache.parquet.hadoop.metadata.CompressionCodecName compressionCodecName, int blockSize, int pageSize, int dictionaryPageSize, boolean enableDictionary, boolean validating, org.apache.parquet.column.ParquetProperties.WriterVersion writerVersion, org.apache.hadoop.conf.Configuration conf)
      Deprecated.
      will be removed in 2.0.0
      ParquetWriter​(org.apache.hadoop.fs.Path file, ParquetFileWriter.Mode mode, WriteSupport<T> writeSupport, org.apache.parquet.hadoop.metadata.CompressionCodecName compressionCodecName, int blockSize, int pageSize, int dictionaryPageSize, boolean enableDictionary, boolean validating, org.apache.parquet.column.ParquetProperties.WriterVersion writerVersion, org.apache.hadoop.conf.Configuration conf)
      Deprecated.
      will be removed in 2.0.0
    • Field Detail

      • DEFAULT_COMPRESSION_CODEC_NAME

        public static final org.apache.parquet.hadoop.metadata.CompressionCodecName DEFAULT_COMPRESSION_CODEC_NAME
      • DEFAULT_IS_DICTIONARY_ENABLED

        public static final boolean DEFAULT_IS_DICTIONARY_ENABLED
        See Also:
        Constant Field Values
      • DEFAULT_IS_VALIDATING_ENABLED

        public static final boolean DEFAULT_IS_VALIDATING_ENABLED
        See Also:
        Constant Field Values
      • DEFAULT_WRITER_VERSION

        public static final org.apache.parquet.column.ParquetProperties.WriterVersion DEFAULT_WRITER_VERSION
      • MAX_PADDING_SIZE_DEFAULT

        public static final int MAX_PADDING_SIZE_DEFAULT
        See Also:
        Constant Field Values
    • Constructor Detail

      • ParquetWriter

        @Deprecated
        public ParquetWriter​(org.apache.hadoop.fs.Path file,
                             WriteSupport<T> writeSupport,
                             org.apache.parquet.hadoop.metadata.CompressionCodecName compressionCodecName,
                             int blockSize,
                             int pageSize)
                      throws IOException
        Deprecated.
        will be removed in 2.0.0
        Create a new ParquetWriter. (with dictionary encoding enabled and validation off)
        Parameters:
        file - the file to create
        writeSupport - the implementation to write a record to a RecordConsumer
        compressionCodecName - the compression codec to use
        blockSize - the block size threshold
        pageSize - the page size threshold
        Throws:
        IOException - if there is an error while writing
      • ParquetWriter

        @Deprecated
        public ParquetWriter​(org.apache.hadoop.fs.Path file,
                             WriteSupport<T> writeSupport,
                             org.apache.parquet.hadoop.metadata.CompressionCodecName compressionCodecName,
                             int blockSize,
                             int pageSize,
                             boolean enableDictionary,
                             boolean validating)
                      throws IOException
        Deprecated.
        will be removed in 2.0.0
        Create a new ParquetWriter.
        Parameters:
        file - the file to create
        writeSupport - the implementation to write a record to a RecordConsumer
        compressionCodecName - the compression codec to use
        blockSize - the block size threshold
        pageSize - the page size threshold (both data and dictionary)
        enableDictionary - to turn dictionary encoding on
        validating - to turn on validation using the schema
        Throws:
        IOException - if there is an error while writing
      • ParquetWriter

        @Deprecated
        public ParquetWriter​(org.apache.hadoop.fs.Path file,
                             WriteSupport<T> writeSupport,
                             org.apache.parquet.hadoop.metadata.CompressionCodecName compressionCodecName,
                             int blockSize,
                             int pageSize,
                             int dictionaryPageSize,
                             boolean enableDictionary,
                             boolean validating)
                      throws IOException
        Deprecated.
        will be removed in 2.0.0
        Create a new ParquetWriter.
        Parameters:
        file - the file to create
        writeSupport - the implementation to write a record to a RecordConsumer
        compressionCodecName - the compression codec to use
        blockSize - the block size threshold
        pageSize - the page size threshold
        dictionaryPageSize - the page size threshold for the dictionary pages
        enableDictionary - to turn dictionary encoding on
        validating - to turn on validation using the schema
        Throws:
        IOException - if there is an error while writing
      • ParquetWriter

        @Deprecated
        public ParquetWriter​(org.apache.hadoop.fs.Path file,
                             WriteSupport<T> writeSupport,
                             org.apache.parquet.hadoop.metadata.CompressionCodecName compressionCodecName,
                             int blockSize,
                             int pageSize,
                             int dictionaryPageSize,
                             boolean enableDictionary,
                             boolean validating,
                             org.apache.parquet.column.ParquetProperties.WriterVersion writerVersion)
                      throws IOException
        Deprecated.
        will be removed in 2.0.0
        Create a new ParquetWriter. Directly instantiates a Hadoop Configuration which reads configuration from the classpath.
        Parameters:
        file - the file to create
        writeSupport - the implementation to write a record to a RecordConsumer
        compressionCodecName - the compression codec to use
        blockSize - the block size threshold
        pageSize - the page size threshold
        dictionaryPageSize - the page size threshold for the dictionary pages
        enableDictionary - to turn dictionary encoding on
        validating - to turn on validation using the schema
        writerVersion - version of parquetWriter from ParquetProperties.WriterVersion
        Throws:
        IOException - if there is an error while writing
      • ParquetWriter

        @Deprecated
        public ParquetWriter​(org.apache.hadoop.fs.Path file,
                             WriteSupport<T> writeSupport,
                             org.apache.parquet.hadoop.metadata.CompressionCodecName compressionCodecName,
                             int blockSize,
                             int pageSize,
                             int dictionaryPageSize,
                             boolean enableDictionary,
                             boolean validating,
                             org.apache.parquet.column.ParquetProperties.WriterVersion writerVersion,
                             org.apache.hadoop.conf.Configuration conf)
                      throws IOException
        Deprecated.
        will be removed in 2.0.0
        Create a new ParquetWriter.
        Parameters:
        file - the file to create
        writeSupport - the implementation to write a record to a RecordConsumer
        compressionCodecName - the compression codec to use
        blockSize - the block size threshold
        pageSize - the page size threshold
        dictionaryPageSize - the page size threshold for the dictionary pages
        enableDictionary - to turn dictionary encoding on
        validating - to turn on validation using the schema
        writerVersion - version of parquetWriter from ParquetProperties.WriterVersion
        conf - Hadoop configuration to use while accessing the filesystem
        Throws:
        IOException - if there is an error while writing
      • ParquetWriter

        @Deprecated
        public ParquetWriter​(org.apache.hadoop.fs.Path file,
                             ParquetFileWriter.Mode mode,
                             WriteSupport<T> writeSupport,
                             org.apache.parquet.hadoop.metadata.CompressionCodecName compressionCodecName,
                             int blockSize,
                             int pageSize,
                             int dictionaryPageSize,
                             boolean enableDictionary,
                             boolean validating,
                             org.apache.parquet.column.ParquetProperties.WriterVersion writerVersion,
                             org.apache.hadoop.conf.Configuration conf)
                      throws IOException
        Deprecated.
        will be removed in 2.0.0
        Create a new ParquetWriter.
        Parameters:
        file - the file to create
        mode - file creation mode
        writeSupport - the implementation to write a record to a RecordConsumer
        compressionCodecName - the compression codec to use
        blockSize - the block size threshold
        pageSize - the page size threshold
        dictionaryPageSize - the page size threshold for the dictionary pages
        enableDictionary - to turn dictionary encoding on
        validating - to turn on validation using the schema
        writerVersion - version of parquetWriter from ParquetProperties.WriterVersion
        conf - Hadoop configuration to use while accessing the filesystem
        Throws:
        IOException - if there is an error while writing
      • ParquetWriter

        @Deprecated
        public ParquetWriter​(org.apache.hadoop.fs.Path file,
                             WriteSupport<T> writeSupport)
                      throws IOException
        Deprecated.
        will be removed in 2.0.0
        Create a new ParquetWriter. The default block size is 128 MB. The default page size is 1 MB. Default compression is no compression. Dictionary encoding is disabled.
        Parameters:
        file - the file to create
        writeSupport - the implementation to write a record to a RecordConsumer
        Throws:
        IOException - if there is an error while writing