@Generated(value="software.amazon.awssdk:codegen") public final class ParquetSerDe extends Object implements SdkPojo, Serializable, ToCopyableBuilder<ParquetSerDe.Builder,ParquetSerDe>
A serializer to use for converting data to the Parquet format before storing it in Amazon S3. For more information, see Apache Parquet.
Modifier and Type | Class and Description |
---|---|
static interface |
ParquetSerDe.Builder |
Modifier and Type | Method and Description |
---|---|
Integer |
blockSizeBytes()
The Hadoop Distributed File System (HDFS) block size.
|
static ParquetSerDe.Builder |
builder() |
ParquetCompression |
compression()
The compression code to use over data blocks.
|
String |
compressionAsString()
The compression code to use over data blocks.
|
Boolean |
enableDictionaryCompression()
Indicates whether to enable dictionary compression.
|
boolean |
equals(Object obj) |
boolean |
equalsBySdkFields(Object obj) |
<T> Optional<T> |
getValueForField(String fieldName,
Class<T> clazz) |
int |
hashCode() |
Integer |
maxPaddingBytes()
The maximum amount of padding to apply.
|
Integer |
pageSizeBytes()
The Parquet page size.
|
List<SdkField<?>> |
sdkFields() |
static Class<? extends ParquetSerDe.Builder> |
serializableBuilderClass() |
ParquetSerDe.Builder |
toBuilder() |
String |
toString()
Returns a string representation of this object.
|
ParquetWriterVersion |
writerVersion()
Indicates the version of row format to output.
|
String |
writerVersionAsString()
Indicates the version of row format to output.
|
clone, finalize, getClass, notify, notifyAll, wait, wait, wait
copy
public final Integer blockSizeBytes()
The Hadoop Distributed File System (HDFS) block size. This is useful if you intend to copy the data from Amazon S3 to HDFS before querying. The default is 256 MiB and the minimum is 64 MiB. Kinesis Data Firehose uses this value for padding calculations.
public final Integer pageSizeBytes()
The Parquet page size. Column chunks are divided into pages. A page is conceptually an indivisible unit (in terms of compression and encoding). The minimum value is 64 KiB and the default is 1 MiB.
public final ParquetCompression compression()
The compression code to use over data blocks. The possible values are UNCOMPRESSED
,
SNAPPY
, and GZIP
, with the default being SNAPPY
. Use SNAPPY
for higher decompression speed. Use GZIP
if the compression ratio is more important than speed.
If the service returns an enum value that is not available in the current SDK version, compression
will
return ParquetCompression.UNKNOWN_TO_SDK_VERSION
. The raw value returned by the service is available from
compressionAsString()
.
UNCOMPRESSED
,
SNAPPY
, and GZIP
, with the default being SNAPPY
. Use
SNAPPY
for higher decompression speed. Use GZIP
if the compression ratio is
more important than speed.ParquetCompression
public final String compressionAsString()
The compression code to use over data blocks. The possible values are UNCOMPRESSED
,
SNAPPY
, and GZIP
, with the default being SNAPPY
. Use SNAPPY
for higher decompression speed. Use GZIP
if the compression ratio is more important than speed.
If the service returns an enum value that is not available in the current SDK version, compression
will
return ParquetCompression.UNKNOWN_TO_SDK_VERSION
. The raw value returned by the service is available from
compressionAsString()
.
UNCOMPRESSED
,
SNAPPY
, and GZIP
, with the default being SNAPPY
. Use
SNAPPY
for higher decompression speed. Use GZIP
if the compression ratio is
more important than speed.ParquetCompression
public final Boolean enableDictionaryCompression()
Indicates whether to enable dictionary compression.
public final Integer maxPaddingBytes()
The maximum amount of padding to apply. This is useful if you intend to copy the data from Amazon S3 to HDFS before querying. The default is 0.
public final ParquetWriterVersion writerVersion()
Indicates the version of row format to output. The possible values are V1
and V2
. The
default is V1
.
If the service returns an enum value that is not available in the current SDK version, writerVersion
will return ParquetWriterVersion.UNKNOWN_TO_SDK_VERSION
. The raw value returned by the service is
available from writerVersionAsString()
.
V1
and
V2
. The default is V1
.ParquetWriterVersion
public final String writerVersionAsString()
Indicates the version of row format to output. The possible values are V1
and V2
. The
default is V1
.
If the service returns an enum value that is not available in the current SDK version, writerVersion
will return ParquetWriterVersion.UNKNOWN_TO_SDK_VERSION
. The raw value returned by the service is
available from writerVersionAsString()
.
V1
and
V2
. The default is V1
.ParquetWriterVersion
public ParquetSerDe.Builder toBuilder()
toBuilder
in interface ToCopyableBuilder<ParquetSerDe.Builder,ParquetSerDe>
public static ParquetSerDe.Builder builder()
public static Class<? extends ParquetSerDe.Builder> serializableBuilderClass()
public final boolean equalsBySdkFields(Object obj)
equalsBySdkFields
in interface SdkPojo
public final String toString()
Copyright © 2022. All rights reserved.