Class Coder<T>
- java.lang.Object
-
- org.apache.beam.sdk.coders.Coder<T>
-
- Type Parameters:
T
- the type of values being encoded and decoded
- All Implemented Interfaces:
java.io.Serializable
- Direct Known Subclasses:
CustomCoder
,StructuredCoder
,ZstdCoder
public abstract class Coder<T> extends java.lang.Object implements java.io.Serializable
ACoder<T>
defines how to encode and decode values of typeT
into byte streams.Coder
instances are serialized during job creation and deserialized before use. This will generally be performed by serializing the object via Java Serialization.Coder
classes for compound types are often composed from coder classes for types contains therein. The composition ofCoder
instances into a coder for the compound class is the subject of theCoderProvider
type, which enables automatic generic composition ofCoder
classes within theCoderRegistry
. SeeCoderProvider
andCoderRegistry
for more information about how coders are inferred.All methods of a
Coder
are required to be thread safe.- See Also:
- Serialized Form
-
-
Nested Class Summary
Nested Classes Modifier and Type Class Description static class
Coder.Context
Deprecated.To implement a coder, do not use anyCoder.Context
.static class
Coder.NonDeterministicException
Exception thrown byverifyDeterministic()
if the encoding is not deterministic, including details of why the encoding is not deterministic.
-
Constructor Summary
Constructors Constructor Description Coder()
-
Method Summary
All Methods Static Methods Instance Methods Abstract Methods Concrete Methods Deprecated Methods Modifier and Type Method Description boolean
consistentWithEquals()
abstract T
decode(java.io.InputStream inStream)
Decodes a value of typeT
from the given input stream in the given context.T
decode(java.io.InputStream inStream, Coder.Context context)
Deprecated.only implement and calldecode(InputStream)
abstract void
encode(T value, java.io.OutputStream outStream)
Encodes the given value of typeT
onto the given output stream.void
encode(T value, java.io.OutputStream outStream, Coder.Context context)
Deprecated.only implement and callencode(Object value, OutputStream)
abstract java.util.List<? extends Coder<?>>
getCoderArguments()
protected long
getEncodedElementByteSize(T value)
Returns the size in bytes of the encoded value using this coder.TypeDescriptor<T>
getEncodedTypeDescriptor()
Returns theTypeDescriptor
for the type encoded.boolean
isRegisterByteSizeObserverCheap(T value)
Returns whetherregisterByteSizeObserver(T, org.apache.beam.sdk.util.common.ElementByteSizeObserver)
cheap enough to call for every element, that is, if thisCoder
can calculate the byte size of the element to be coded in roughly constant time (or lazily).void
registerByteSizeObserver(T value, ElementByteSizeObserver observer)
Notifies theElementByteSizeObserver
about the byte size of the encoded value using thisCoder
.java.lang.Object
structuralValue(T value)
Returns an object with anObject.equals()
method that represents structural equality on the argument.abstract void
verifyDeterministic()
ThrowCoder.NonDeterministicException
if the coding is not deterministic.static void
verifyDeterministic(Coder<?> target, java.lang.String message, java.lang.Iterable<Coder<?>> coders)
Verifies all of the provided coders are deterministic.static void
verifyDeterministic(Coder<?> target, java.lang.String message, Coder<?>... coders)
Verifies all of the provided coders are deterministic.
-
-
-
Method Detail
-
encode
public abstract void encode(T value, java.io.OutputStream outStream) throws CoderException, java.io.IOException
Encodes the given value of typeT
onto the given output stream.- Throws:
java.io.IOException
- if writing to theOutputStream
fails for some reasonCoderException
- if the value could not be encoded for some reason
-
encode
@Deprecated @Experimental(CODER_CONTEXT) public void encode(T value, java.io.OutputStream outStream, Coder.Context context) throws CoderException, java.io.IOException
Deprecated.only implement and callencode(Object value, OutputStream)
Encodes the given value of typeT
onto the given output stream in the given context.- Throws:
java.io.IOException
- if writing to theOutputStream
fails for some reasonCoderException
- if the value could not be encoded for some reason
-
decode
public abstract T decode(java.io.InputStream inStream) throws CoderException, java.io.IOException
Decodes a value of typeT
from the given input stream in the given context. Returns the decoded value.- Throws:
java.io.IOException
- if reading from theInputStream
fails for some reasonCoderException
- if the value could not be decoded for some reason
-
decode
@Deprecated @Experimental(CODER_CONTEXT) public T decode(java.io.InputStream inStream, Coder.Context context) throws CoderException, java.io.IOException
Deprecated.only implement and calldecode(InputStream)
Decodes a value of typeT
from the given input stream in the given context. Returns the decoded value.- Throws:
java.io.IOException
- if reading from theInputStream
fails for some reasonCoderException
- if the value could not be decoded for some reason
-
getCoderArguments
public abstract java.util.List<? extends Coder<?>> getCoderArguments()
-
verifyDeterministic
public abstract void verifyDeterministic() throws Coder.NonDeterministicException
ThrowCoder.NonDeterministicException
if the coding is not deterministic.In order for a
Coder
to be considered deterministic, the following must be true:- two values that compare as equal (via
Object.equals()
orComparable.compareTo()
, if supported) have the same encoding. - the
Coder
always produces a canonical encoding, which is the same for an instance of an object even if produced on different computers at different times.
- Throws:
Coder.NonDeterministicException
- if this coder is not deterministic.
- two values that compare as equal (via
-
verifyDeterministic
public static void verifyDeterministic(Coder<?> target, java.lang.String message, java.lang.Iterable<Coder<?>> coders) throws Coder.NonDeterministicException
Verifies all of the provided coders are deterministic. If any are not, throws aCoder.NonDeterministicException
for thetarget
Coder
.- Throws:
Coder.NonDeterministicException
-
verifyDeterministic
public static void verifyDeterministic(Coder<?> target, java.lang.String message, Coder<?>... coders) throws Coder.NonDeterministicException
Verifies all of the provided coders are deterministic. If any are not, throws aCoder.NonDeterministicException
for thetarget
Coder
.- Throws:
Coder.NonDeterministicException
-
consistentWithEquals
public boolean consistentWithEquals()
Returnstrue
if thisCoder
is injective with respect toObject.equals(java.lang.Object)
.Whenever the encoded bytes of two values are equal, then the original values are equal according to
Objects.equals()
. Note that this is well-defined fornull
.This condition is most notably false for arrays. More generally, this condition is false whenever
equals()
compares object identity, rather than performing a semantic/structural comparison.By default, returns false.
-
structuralValue
public java.lang.Object structuralValue(T value)
Returns an object with anObject.equals()
method that represents structural equality on the argument.For any two values
x
andy
of typeT
, if their encoded bytes are the same, then it must be the case thatstructuralValue(x).equals(structuralValue(y))
.Most notably:
- The structural value for an array coder should perform a structural comparison of the contents of the arrays, rather than the default behavior of comparing according to object identity.
- The structural value for a coder accepting
null
should be a proper object with anequals()
method, even if the input value isnull
.
See also
consistentWithEquals()
.By default, if this coder is
consistentWithEquals()
, and the value is not null, returns the provided object. Otherwise, encodes the value into abyte[]
, and returns an object that performs array equality on the encoded bytes.
-
isRegisterByteSizeObserverCheap
public boolean isRegisterByteSizeObserverCheap(T value)
Returns whetherregisterByteSizeObserver(T, org.apache.beam.sdk.util.common.ElementByteSizeObserver)
cheap enough to call for every element, that is, if thisCoder
can calculate the byte size of the element to be coded in roughly constant time (or lazily).Not intended to be called by user code, but instead by
PipelineRunner
implementations.By default, returns false. The default
registerByteSizeObserver(T, org.apache.beam.sdk.util.common.ElementByteSizeObserver)
implementation invokesgetEncodedElementByteSize(T)
which requires re-encoding an element unless it is overridden. This is considered expensive.
-
registerByteSizeObserver
public void registerByteSizeObserver(T value, ElementByteSizeObserver observer) throws java.lang.Exception
Notifies theElementByteSizeObserver
about the byte size of the encoded value using thisCoder
.Not intended to be called by user code, but instead by
PipelineRunner
implementations.By default, this notifies
observer
about the byte size of the encoded value using this coder as returned bygetEncodedElementByteSize(T)
.- Throws:
java.lang.Exception
-
getEncodedElementByteSize
protected long getEncodedElementByteSize(T value) throws java.lang.Exception
Returns the size in bytes of the encoded value using this coder.- Throws:
java.lang.Exception
-
getEncodedTypeDescriptor
@Experimental(CODER_TYPE_ENCODING) public TypeDescriptor<T> getEncodedTypeDescriptor()
Returns theTypeDescriptor
for the type encoded.
-
-