public static interface ParquetSerDe.Builder extends SdkPojo, CopyableBuilder<ParquetSerDe.Builder,ParquetSerDe>
Modifier and Type | Method and Description |
---|---|
ParquetSerDe.Builder |
blockSizeBytes(Integer blockSizeBytes)
The Hadoop Distributed File System (HDFS) block size.
|
ParquetSerDe.Builder |
compression(ParquetCompression compression)
The compression code to use over data blocks.
|
ParquetSerDe.Builder |
compression(String compression)
The compression code to use over data blocks.
|
ParquetSerDe.Builder |
enableDictionaryCompression(Boolean enableDictionaryCompression)
Indicates whether to enable dictionary compression.
|
ParquetSerDe.Builder |
maxPaddingBytes(Integer maxPaddingBytes)
The maximum amount of padding to apply.
|
ParquetSerDe.Builder |
pageSizeBytes(Integer pageSizeBytes)
The Parquet page size.
|
ParquetSerDe.Builder |
writerVersion(ParquetWriterVersion writerVersion)
Indicates the version of row format to output.
|
ParquetSerDe.Builder |
writerVersion(String writerVersion)
Indicates the version of row format to output.
|
equalsBySdkFields, sdkFields
copy
applyMutation, build
ParquetSerDe.Builder blockSizeBytes(Integer blockSizeBytes)
The Hadoop Distributed File System (HDFS) block size. This is useful if you intend to copy the data from Amazon S3 to HDFS before querying. The default is 256 MiB and the minimum is 64 MiB. Kinesis Data Firehose uses this value for padding calculations.
blockSizeBytes
- The Hadoop Distributed File System (HDFS) block size. This is useful if you intend to copy the data
from Amazon S3 to HDFS before querying. The default is 256 MiB and the minimum is 64 MiB. Kinesis Data
Firehose uses this value for padding calculations.ParquetSerDe.Builder pageSizeBytes(Integer pageSizeBytes)
The Parquet page size. Column chunks are divided into pages. A page is conceptually an indivisible unit (in terms of compression and encoding). The minimum value is 64 KiB and the default is 1 MiB.
pageSizeBytes
- The Parquet page size. Column chunks are divided into pages. A page is conceptually an indivisible
unit (in terms of compression and encoding). The minimum value is 64 KiB and the default is 1 MiB.ParquetSerDe.Builder compression(String compression)
The compression code to use over data blocks. The possible values are UNCOMPRESSED
,
SNAPPY
, and GZIP
, with the default being SNAPPY
. Use
SNAPPY
for higher decompression speed. Use GZIP
if the compression ratio is more
important than speed.
compression
- The compression code to use over data blocks. The possible values are UNCOMPRESSED
,
SNAPPY
, and GZIP
, with the default being SNAPPY
. Use
SNAPPY
for higher decompression speed. Use GZIP
if the compression ratio is
more important than speed.ParquetCompression
,
ParquetCompression
ParquetSerDe.Builder compression(ParquetCompression compression)
The compression code to use over data blocks. The possible values are UNCOMPRESSED
,
SNAPPY
, and GZIP
, with the default being SNAPPY
. Use
SNAPPY
for higher decompression speed. Use GZIP
if the compression ratio is more
important than speed.
compression
- The compression code to use over data blocks. The possible values are UNCOMPRESSED
,
SNAPPY
, and GZIP
, with the default being SNAPPY
. Use
SNAPPY
for higher decompression speed. Use GZIP
if the compression ratio is
more important than speed.ParquetCompression
,
ParquetCompression
ParquetSerDe.Builder enableDictionaryCompression(Boolean enableDictionaryCompression)
Indicates whether to enable dictionary compression.
enableDictionaryCompression
- Indicates whether to enable dictionary compression.ParquetSerDe.Builder maxPaddingBytes(Integer maxPaddingBytes)
The maximum amount of padding to apply. This is useful if you intend to copy the data from Amazon S3 to HDFS before querying. The default is 0.
maxPaddingBytes
- The maximum amount of padding to apply. This is useful if you intend to copy the data from Amazon S3
to HDFS before querying. The default is 0.ParquetSerDe.Builder writerVersion(String writerVersion)
Indicates the version of row format to output. The possible values are V1
and V2
.
The default is V1
.
writerVersion
- Indicates the version of row format to output. The possible values are V1
and
V2
. The default is V1
.ParquetWriterVersion
,
ParquetWriterVersion
ParquetSerDe.Builder writerVersion(ParquetWriterVersion writerVersion)
Indicates the version of row format to output. The possible values are V1
and V2
.
The default is V1
.
writerVersion
- Indicates the version of row format to output. The possible values are V1
and
V2
. The default is V1
.ParquetWriterVersion
,
ParquetWriterVersion
Copyright © 2022. All rights reserved.