- add(Long) - Method in class com.google.cloud.dataflow.sdk.transforms.ApproximateUnique.ApproximateUniqueCombineFn.LargestUnique
-
Adds a value to the heap, returning whether the value is (large enough
to be) in the heap.
- addCollectionToSingletonOutput(String, PValue, PValue) - Method in interface com.google.cloud.dataflow.sdk.runners.DataflowPipelineTranslator.TranslationContext
-
Adds an output with the given name to the previously added
CollectionToSingleton Dataflow step, consuming the specified
input PValue and producing the specified output
PValue.
- addComposite(TransformTreeNode) - Method in class com.google.cloud.dataflow.sdk.runners.TransformTreeNode
-
Adds a composite operation to the transform node.
- addEncodingInput(Coder<?>) - Method in interface com.google.cloud.dataflow.sdk.runners.DataflowPipelineTranslator.TranslationContext
-
Sets the encoding for the current Dataflow step.
- addInput(String, Boolean) - Method in interface com.google.cloud.dataflow.sdk.runners.DataflowPipelineTranslator.TranslationContext
-
Adds an input with the given name and value to the current
Dataflow step.
- addInput(String, String) - Method in interface com.google.cloud.dataflow.sdk.runners.DataflowPipelineTranslator.TranslationContext
-
Adds an input with the given name and value to the current
Dataflow step.
- addInput(String, Long) - Method in interface com.google.cloud.dataflow.sdk.runners.DataflowPipelineTranslator.TranslationContext
-
Adds an input with the given name and value to the current
Dataflow step.
- addInput(String, PInput) - Method in interface com.google.cloud.dataflow.sdk.runners.DataflowPipelineTranslator.TranslationContext
-
Adds an input with the given name to the previously added Dataflow
step, coming from the specified input PValue.
- addInput(String, Map<String, Object>) - Method in interface com.google.cloud.dataflow.sdk.runners.DataflowPipelineTranslator.TranslationContext
-
Adds an input which is a dictionary of strings to objects.
- addInput(String, List<? extends Map<String, Object>>) - Method in interface com.google.cloud.dataflow.sdk.runners.DataflowPipelineTranslator.TranslationContext
-
Adds an input which is a list of objects.
- addInput(TransformTreeNode, PInput) - Method in class com.google.cloud.dataflow.sdk.runners.TransformHierarchy
-
Adds an input to the given node.
- addInput(ApproximateUnique.ApproximateUniqueCombineFn.LargestUnique, T) - Method in class com.google.cloud.dataflow.sdk.transforms.ApproximateUnique.ApproximateUniqueCombineFn
-
- addInput(VI) - Method in interface com.google.cloud.dataflow.sdk.transforms.Combine.AccumulatingCombineFn.Accumulator
-
Adds the given input value to this accumulator, modifying
this accumulator.
- addInput(VA, VI) - Method in class com.google.cloud.dataflow.sdk.transforms.Combine.AccumulatingCombineFn
-
- addInput(VA, VI) - Method in class com.google.cloud.dataflow.sdk.transforms.Combine.CombineFn
-
Adds the given input value to the given accumulator,
modifying the accumulator.
- addInput(K, VA, VI) - Method in class com.google.cloud.dataflow.sdk.transforms.Combine.KeyedCombineFn
-
Adds the given input value to the given accumulator,
modifying the accumulator.
- addInput(Top.TopCombineFn<KV<Integer, T>>.Heap, T) - Method in class com.google.cloud.dataflow.sdk.transforms.Sample.FixedSizedSampleFn
-
- addInputProducer(PValue, TransformTreeNode) - Method in class com.google.cloud.dataflow.sdk.runners.TransformTreeNode
-
Adds an input to the transform node.
- addInputsRandomly(Combine.KeyedCombineFn<? super K, ? super VI, VA, ?>, K, Iterable<VI>, Random) - Static method in class com.google.cloud.dataflow.sdk.runners.DirectPipelineRunner.TestCombineDoFn
-
- addOutput(String, PValue) - Method in interface com.google.cloud.dataflow.sdk.runners.DataflowPipelineTranslator.TranslationContext
-
Adds an output with the given name to the previously added
Dataflow step, producing the specified output PValue,
including its Coder if a TypedPValue.
- addStep(PTransform<?, ?>, String) - Method in interface com.google.cloud.dataflow.sdk.runners.DataflowPipelineTranslator.TranslationContext
-
Adds a step to the Dataflow workflow for the given transform, with
the given Dataflow step type.
- addStep(PTransform<?, ? extends PValue>, Step) - Method in interface com.google.cloud.dataflow.sdk.runners.DataflowPipelineTranslator.TranslationContext
-
Adds a pre-defined step to the Dataflow workflow.
- addValue(VI) - Method in interface com.google.cloud.dataflow.sdk.transforms.Aggregator
-
Adds a new value into the Aggregator.
- addValueInternal(PValue) - Method in class com.google.cloud.dataflow.sdk.Pipeline
-
Adds the given PValue to this Pipeline.
- addValueOnlyOutput(String, PValue) - Method in interface com.google.cloud.dataflow.sdk.runners.DataflowPipelineTranslator.TranslationContext
-
Adds an output with the given name to the previously added
Dataflow step, producing the specified output PValue,
including its Coder if a TypedPValue.
- advance() - Method in class com.google.cloud.dataflow.sdk.io.DatastoreIO.DatastoreReader
-
- advance() - Method in interface com.google.cloud.dataflow.sdk.io.Source.Reader
-
Advances the iterator to the next valid record.
- Aggregator<VI> - Interface in com.google.cloud.dataflow.sdk.transforms
-
An Aggregator enables arbitrary monitoring in user code.
- and(TupleTag<V>, List<V>) - Method in class com.google.cloud.dataflow.sdk.transforms.join.CoGbkResult
-
Returns a new CoGbkResult based on this, with the given tag and given data
added to it.
- and(TupleTag<V>, PCollection<KV<K, V>>) - Method in class com.google.cloud.dataflow.sdk.transforms.join.KeyedPCollectionTuple
-
Returns a new KeyedPCollections<K> that is the same as this,
appended with the given PCollection.
- and(PCollection<T>) - Method in class com.google.cloud.dataflow.sdk.values.PCollectionList
-
Returns a new PCollectionList that has all the PCollections of
this PCollectionList plus the given PCollection appended to the end.
- and(Iterable<PCollection<T>>) - Method in class com.google.cloud.dataflow.sdk.values.PCollectionList
-
Returns a new PCollectionList that has all the PCollections of
this PCollectionList plus the given PCollections appended to the end,
in order.
- and(TupleTag<T>, PCollection<T>) - Method in class com.google.cloud.dataflow.sdk.values.PCollectionTuple
-
Returns a new PCollectionTuple that has all the PCollections and
tags of this PCollectionTuple plus the given PCollection and tag.
- and(TupleTag<?>) - Method in class com.google.cloud.dataflow.sdk.values.TupleTagList
-
Returns a new TupleTagList that has all the TupleTags of
this TupleTagList plus the given TupleTag appended to the end.
- and(List<TupleTag<?>>) - Method in class com.google.cloud.dataflow.sdk.values.TupleTagList
-
Returns a new TupleTagList that has all the TupleTags of
this TupleTagList plus the given TupleTags appended to the end,
in order.
- ApplicationNameOptions - Interface in com.google.cloud.dataflow.sdk.options
-
Options that allow setting the application name.
- apply(Input) - Method in interface com.google.cloud.dataflow.sdk.coders.DelegateCoder.CodingFunction
-
- apply(PInput) - Method in class com.google.cloud.dataflow.sdk.io.AvroIO.Read.Bound
-
- apply(PCollection<T>) - Method in class com.google.cloud.dataflow.sdk.io.AvroIO.Write.Bound
-
- apply(PInput) - Method in class com.google.cloud.dataflow.sdk.io.BigQueryIO.Read.Bound
-
- apply(PCollection<TableRow>) - Method in class com.google.cloud.dataflow.sdk.io.BigQueryIO.Write.Bound
-
- apply(PCollection<DatastoreV1.Entity>) - Method in class com.google.cloud.dataflow.sdk.io.DatastoreIO.Sink
-
- apply(PInput) - Method in class com.google.cloud.dataflow.sdk.io.PubsubIO.Read.Bound
-
- apply(PCollection<String>) - Method in class com.google.cloud.dataflow.sdk.io.PubsubIO.Write.Bound
-
- apply(PInput) - Method in class com.google.cloud.dataflow.sdk.io.ReadSource.Bound
-
- apply(PInput) - Method in class com.google.cloud.dataflow.sdk.io.TextIO.Read.Bound
-
- apply(PCollection<T>) - Method in class com.google.cloud.dataflow.sdk.io.TextIO.Write.Bound
-
- apply(PTransform<? super PBegin, Output>) - Method in class com.google.cloud.dataflow.sdk.Pipeline
-
Starts using this pipeline with a root PTransform such as
TextIO.Read or
Create.
- apply(PTransform<Input, Output>, Input) - Method in class com.google.cloud.dataflow.sdk.runners.BlockingDataflowPipelineRunner
-
- apply(PTransform<Input, Output>, Input) - Method in class com.google.cloud.dataflow.sdk.runners.DataflowPipelineRunner
-
- apply(PTransform<Input, Output>, Input) - Method in class com.google.cloud.dataflow.sdk.runners.DirectPipelineRunner
-
- apply(PTransform<Input, Output>, Input) - Method in class com.google.cloud.dataflow.sdk.runners.PipelineRunner
-
Applies a transform to the given input, returning the output.
- apply(Actual) - Method in class com.google.cloud.dataflow.sdk.testing.DataflowAssert.AssertThat
-
- apply(Iterable<? extends VI>) - Method in class com.google.cloud.dataflow.sdk.transforms.Combine.CombineFn
-
Applies this CombineFn to a collection of input values
to produce a combined output value.
- apply(PCollection<VI>) - Method in class com.google.cloud.dataflow.sdk.transforms.Combine.Globally
-
- apply(PCollection<? extends KV<K, ? extends Iterable<VI>>>) - Method in class com.google.cloud.dataflow.sdk.transforms.Combine.GroupedValues
-
- apply(K, Iterable<? extends VI>) - Method in class com.google.cloud.dataflow.sdk.transforms.Combine.KeyedCombineFn
-
Applies this KeyedCombineFn to a key and a collection
of input values to produce a combined output value.
- apply(PCollection<KV<K, VI>>) - Method in class com.google.cloud.dataflow.sdk.transforms.Combine.PerKey
-
- apply(PCollection<KV<K, VI>>) - Method in class com.google.cloud.dataflow.sdk.transforms.Combine.PerKeyWithHotKeys
-
- apply(PCollection<T>) - Method in class com.google.cloud.dataflow.sdk.transforms.Count.Globally
-
- apply(PCollection<T>) - Method in class com.google.cloud.dataflow.sdk.transforms.Count.PerElement
-
- apply(PInput) - Method in class com.google.cloud.dataflow.sdk.transforms.Create
-
- apply(PCollection<T>) - Method in class com.google.cloud.dataflow.sdk.transforms.First
-
- apply(PCollection<Iterable<T>>) - Method in class com.google.cloud.dataflow.sdk.transforms.Flatten.FlattenIterables
-
- apply(PCollectionList<T>) - Method in class com.google.cloud.dataflow.sdk.transforms.Flatten.FlattenPCollectionList
-
- apply(PCollection<KV<K, V>>) - Method in class com.google.cloud.dataflow.sdk.transforms.GroupByKey
-
- apply(PCollection<KV<K, Iterable<WindowedValue<V>>>>) - Method in class com.google.cloud.dataflow.sdk.transforms.GroupByKey.GroupAlsoByWindow
-
- apply(PCollection<KV<K, V>>) - Method in class com.google.cloud.dataflow.sdk.transforms.GroupByKey.GroupByKeyOnly
-
- apply(PCollection<KV<K, V>>) - Method in class com.google.cloud.dataflow.sdk.transforms.GroupByKey.ReifyTimestampsAndWindows
-
- apply(PCollection<KV<K, Iterable<WindowedValue<V>>>>) - Method in class com.google.cloud.dataflow.sdk.transforms.GroupByKey.SortValuesByTimestamp
-
- apply(KeyedPCollectionTuple<K>) - Method in class com.google.cloud.dataflow.sdk.transforms.join.CoGroupByKey
-
- apply(PTransform<KeyedPCollectionTuple<K>, O>) - Method in class com.google.cloud.dataflow.sdk.transforms.join.KeyedPCollectionTuple
-
Applies the given transform to this input.
- apply(PCollection<? extends KV<K, ?>>) - Method in class com.google.cloud.dataflow.sdk.transforms.Keys
-
- apply(PCollection<KV<A, B>>) - Method in class com.google.cloud.dataflow.sdk.transforms.KvSwap
-
- apply(Iterable<N>) - Method in class com.google.cloud.dataflow.sdk.transforms.Max.MaxFn
-
- apply(Iterable<N>) - Method in class com.google.cloud.dataflow.sdk.transforms.Min.MinFn
-
- apply(PCollection<? extends I>) - Method in class com.google.cloud.dataflow.sdk.transforms.ParDo.Bound
-
- apply(PCollection<? extends I>) - Method in class com.google.cloud.dataflow.sdk.transforms.ParDo.BoundMulti
-
- apply(PCollection<T>) - Method in class com.google.cloud.dataflow.sdk.transforms.Partition
-
- apply(Input) - Method in class com.google.cloud.dataflow.sdk.transforms.PTransform
-
Applies this PTransform on the given Input, and returns its
Output.
- apply(PCollection<? extends I>) - Method in class com.google.cloud.dataflow.sdk.transforms.RateLimiting.RateLimitingTransform
-
- apply(PCollection<T>) - Method in class com.google.cloud.dataflow.sdk.transforms.RemoveDuplicates
-
- apply(I) - Method in interface com.google.cloud.dataflow.sdk.transforms.SerializableFunction
-
Returns the result of invoking this function on the given input.
- apply(Iterable<Double>) - Method in class com.google.cloud.dataflow.sdk.transforms.Sum.SumDoubleFn
-
- apply(Iterable<Integer>) - Method in class com.google.cloud.dataflow.sdk.transforms.Sum.SumIntegerFn
-
- apply(Iterable<Long>) - Method in class com.google.cloud.dataflow.sdk.transforms.Sum.SumLongFn
-
- apply(PCollection<? extends KV<?, V>>) - Method in class com.google.cloud.dataflow.sdk.transforms.Values
-
- apply(PCollection<T>) - Method in class com.google.cloud.dataflow.sdk.transforms.View.AsIterable
-
- apply(PCollection<T>) - Method in class com.google.cloud.dataflow.sdk.transforms.View.AsSingleton
-
- apply(PCollection<R>) - Method in class com.google.cloud.dataflow.sdk.transforms.View.CreatePCollectionView
-
- apply(PCollection<T>) - Method in class com.google.cloud.dataflow.sdk.transforms.windowing.Window.Bound
-
- apply(PCollection<T>) - Method in class com.google.cloud.dataflow.sdk.transforms.windowing.Window.Remerge
-
- apply(PCollection<V>) - Method in class com.google.cloud.dataflow.sdk.transforms.WithKeys
-
- apply(PTransform<? super PBegin, Output>) - Method in class com.google.cloud.dataflow.sdk.values.PBegin
-
Applies the given PTransform to this input PBegin, and
returns the PTransform's Output.
- apply(PTransform<? super PCollection<T>, Output>) - Method in class com.google.cloud.dataflow.sdk.values.PCollection
-
Applies the given PTransform to this input PCollection, and
returns the PTransform's Output.
- apply(PTransform<PCollectionList<T>, Output>) - Method in class com.google.cloud.dataflow.sdk.values.PCollectionList
-
Applies the given PTransform to this input PCollectionList<T>,
and returns the PTransform's Output.
- apply(PTransform<PCollectionTuple, Output>) - Method in class com.google.cloud.dataflow.sdk.values.PCollectionTuple
-
Applies the given PTransform to this input PCollectionTuple, and
returns the PTransform's Output.
- applyHelper(PCollection<KV<K, V>>, boolean, boolean) - Method in class com.google.cloud.dataflow.sdk.transforms.GroupByKey
-
- applyTransform(Input, PTransform<? super Input, Output>) - Static method in class com.google.cloud.dataflow.sdk.Pipeline
-
Applies the given PTransform to the given Input,
and returns its Output.
- ApproximateQuantiles - Class in com.google.cloud.dataflow.sdk.transforms
-
PTransforms for getting an idea of a PCollection's
data distribution using approximate N-tiles, either
globally or per-key.
- ApproximateQuantiles() - Constructor for class com.google.cloud.dataflow.sdk.transforms.ApproximateQuantiles
-
- ApproximateQuantiles.ApproximateQuantilesCombineFn<T,C extends java.util.Comparator<T> & java.io.Serializable> - Class in com.google.cloud.dataflow.sdk.transforms
-
The ApproximateQuantilesCombineFn combiner gives an idea
of the distribution of a collection of values using approximate
N-tiles.
- ApproximateUnique - Class in com.google.cloud.dataflow.sdk.transforms
-
PTransforms for estimating the number of distinct elements
in a PCollection, or the number of distinct values
associated with each key in a PCollection of KVs.
- ApproximateUnique() - Constructor for class com.google.cloud.dataflow.sdk.transforms.ApproximateUnique
-
- ApproximateUnique.ApproximateUniqueCombineFn<T> - Class in com.google.cloud.dataflow.sdk.transforms
-
CombineFn that computes an estimate of the number of
distinct values that were combined.
- ApproximateUnique.ApproximateUniqueCombineFn(long, Coder<T>) - Constructor for class com.google.cloud.dataflow.sdk.transforms.ApproximateUnique.ApproximateUniqueCombineFn
-
- ApproximateUnique.ApproximateUniqueCombineFn.LargestUnique - Class in com.google.cloud.dataflow.sdk.transforms
-
A heap utility class to efficiently track the largest added elements.
- ApproximateUnique.ApproximateUniqueCombineFn.LargestUnique(long) - Constructor for class com.google.cloud.dataflow.sdk.transforms.ApproximateUnique.ApproximateUniqueCombineFn.LargestUnique
-
Creates a heap to track the largest sampleSize elements.
- as(Class<T>) - Method in interface com.google.cloud.dataflow.sdk.options.PipelineOptions
-
Transforms this object into an object of type .
- as(Class<T>) - Static method in class com.google.cloud.dataflow.sdk.options.PipelineOptionsFactory
-
Creates and returns an object which implements <T>.
- as(Class<T>) - Method in class com.google.cloud.dataflow.sdk.options.PipelineOptionsFactory.Builder
-
Creates and returns an object which implements <T> using the values configured on
this builder during construction.
- asCloudObject() - Method in class com.google.cloud.dataflow.sdk.coders.AvroCoder
-
- asCloudObject() - Method in interface com.google.cloud.dataflow.sdk.coders.Coder
-
Returns the CloudObject that represents this Coder.
- asCloudObject() - Method in class com.google.cloud.dataflow.sdk.coders.CustomCoder
-
- asCloudObject() - Method in class com.google.cloud.dataflow.sdk.coders.IterableCoder
-
- asCloudObject() - Method in class com.google.cloud.dataflow.sdk.coders.KvCoder
-
- asCloudObject() - Method in class com.google.cloud.dataflow.sdk.coders.SerializableCoder
-
- asCloudObject() - Method in class com.google.cloud.dataflow.sdk.coders.StandardCoder
-
- asCloudObject() - Method in class com.google.cloud.dataflow.sdk.transforms.join.CoGbkResult.CoGbkResultCoder
-
- asCloudObject() - Method in class com.google.cloud.dataflow.sdk.transforms.join.CoGbkResultSchema
-
- asCloudObject() - Method in class com.google.cloud.dataflow.sdk.values.TupleTag
-
- asIterable() - Static method in class com.google.cloud.dataflow.sdk.transforms.View
-
- asKeyedFn() - Method in class com.google.cloud.dataflow.sdk.transforms.Combine.CombineFn
-
Converts this
CombineFn into an equivalent
Combine.KeyedCombineFn, which ignores the keys passed to it and
combines the values according to this
CombineFn.
- asOutputReference(PValue) - Method in interface com.google.cloud.dataflow.sdk.runners.DataflowPipelineTranslator.TranslationContext
-
Encode a PValue reference as an output reference.
- assertFor(Expected) - Method in interface com.google.cloud.dataflow.sdk.testing.DataflowAssert.AssertRelation
-
- assignWindow(Instant) - Method in class com.google.cloud.dataflow.sdk.transforms.windowing.CalendarWindows.DaysWindows
-
- assignWindow(Instant) - Method in class com.google.cloud.dataflow.sdk.transforms.windowing.CalendarWindows.MonthsWindows
-
- assignWindow(Instant) - Method in class com.google.cloud.dataflow.sdk.transforms.windowing.CalendarWindows.YearsWindows
-
- assignWindow(Instant) - Method in class com.google.cloud.dataflow.sdk.transforms.windowing.FixedWindows
-
- assignWindow(Instant) - Method in class com.google.cloud.dataflow.sdk.transforms.windowing.PartitioningWindowFn
-
Returns the single window to which elements with this timestamp belong.
- assignWindows(WindowFn<Object, GlobalWindow>.AssignContext) - Method in class com.google.cloud.dataflow.sdk.transforms.windowing.GlobalWindows
-
- assignWindows(WindowFn<Object, W>.AssignContext) - Method in class com.google.cloud.dataflow.sdk.transforms.windowing.InvalidWindows
-
- assignWindows(WindowFn<T, W>.AssignContext) - Method in class com.google.cloud.dataflow.sdk.transforms.windowing.PartitioningWindowFn
-
- assignWindows(WindowFn<Object, IntervalWindow>.AssignContext) - Method in class com.google.cloud.dataflow.sdk.transforms.windowing.Sessions
-
- assignWindows(WindowFn<Object, IntervalWindow>.AssignContext) - Method in class com.google.cloud.dataflow.sdk.transforms.windowing.SlidingWindows
-
- assignWindows(WindowFn<T, W>.AssignContext) - Method in class com.google.cloud.dataflow.sdk.transforms.windowing.WindowFn
-
Given a timestamp and element, returns the set of windows into which it
should be placed.
- asSingleton() - Static method in class com.google.cloud.dataflow.sdk.transforms.View
-
- AtomicCoder<T> - Class in com.google.cloud.dataflow.sdk.coders
-
An AtomicCoder is one that has no component Coders or other state.
- AtomicCoder() - Constructor for class com.google.cloud.dataflow.sdk.coders.AtomicCoder
-
- AvroCoder<T> - Class in com.google.cloud.dataflow.sdk.coders
-
An encoder using Avro binary format.
- AvroCoder(Class<T>, Schema) - Constructor for class com.google.cloud.dataflow.sdk.coders.AvroCoder
-
- AvroCoder.AvroDeterminismChecker - Class in com.google.cloud.dataflow.sdk.coders
-
Helper class encapsulating the various pieces of state maintained by the
recursive walk used for checking if the encoding will be deterministic.
- AvroCoder.AvroDeterminismChecker() - Constructor for class com.google.cloud.dataflow.sdk.coders.AvroCoder.AvroDeterminismChecker
-
- AvroIO - Class in com.google.cloud.dataflow.sdk.io
-
Transforms for reading and writing Avro files.
- AvroIO() - Constructor for class com.google.cloud.dataflow.sdk.io.AvroIO
-
- AvroIO.Read - Class in com.google.cloud.dataflow.sdk.io
-
A root PTransform that reads from an Avro file (or multiple Avro
files matching a pattern) and returns a PCollection containing
the decoding of each record.
- AvroIO.Read() - Constructor for class com.google.cloud.dataflow.sdk.io.AvroIO.Read
-
- AvroIO.Read.Bound<T> - Class in com.google.cloud.dataflow.sdk.io
-
A PTransform that reads from an Avro file (or multiple Avro
files matching a pattern) and returns a bounded PCollection containing
the decoding of each record.
- AvroIO.Write - Class in com.google.cloud.dataflow.sdk.io
-
A root PTransform that writes a PCollection to an Avro file (or
multiple Avro files matching a sharding pattern).
- AvroIO.Write() - Constructor for class com.google.cloud.dataflow.sdk.io.AvroIO.Write
-
- AvroIO.Write.Bound<T> - Class in com.google.cloud.dataflow.sdk.io
-
A PTransform that writes a bounded PCollection to an Avro file (or
multiple Avro files matching a sharding pattern).
- CalendarWindows - Class in com.google.cloud.dataflow.sdk.transforms.windowing
-
A collection of
WindowFns that windows values into calendar-based
windows such as spans of days, months, or years.
- CalendarWindows() - Constructor for class com.google.cloud.dataflow.sdk.transforms.windowing.CalendarWindows
-
- CalendarWindows.DaysWindows - Class in com.google.cloud.dataflow.sdk.transforms.windowing
-
A
WindowFn that windows elements into periods measured by days.
- CalendarWindows.MonthsWindows - Class in com.google.cloud.dataflow.sdk.transforms.windowing
-
A
WindowFn that windows elements into periods measured by months.
- CalendarWindows.YearsWindows - Class in com.google.cloud.dataflow.sdk.transforms.windowing
-
A
WindowFn that windows elements into periods measured by years.
- check(TypeToken<?>, Schema) - Method in class com.google.cloud.dataflow.sdk.coders.AvroCoder.AvroDeterminismChecker
-
- clearOutputElements() - Method in class com.google.cloud.dataflow.sdk.transforms.DoFnTester
-
Clears the record of the elements output so far to the main output.
- clearSideOutputElements(TupleTag<T>) - Method in class com.google.cloud.dataflow.sdk.transforms.DoFnTester
-
Clears the record of the elements output so far to the side
output with the given tag.
- close() - Method in class com.google.cloud.dataflow.sdk.io.DatastoreIO.DatastoreReader
-
- close() - Method in interface com.google.cloud.dataflow.sdk.io.Source.Reader
-
Closes the iterator.
- CloudDebuggerOptions - Interface in com.google.cloud.dataflow.sdk.options
-
Options for controlling Cloud Debugger.
- CloudDebuggerOptions.DebuggerConfig - Class in com.google.cloud.dataflow.sdk.options
-
Return a JSON string for the Debugger metadata item.
- CloudDebuggerOptions.DebuggerConfig() - Constructor for class com.google.cloud.dataflow.sdk.options.CloudDebuggerOptions.DebuggerConfig
-
- CodedTupleTag<T> - Class in com.google.cloud.dataflow.sdk.values
-
A
TupleTag combined with the
Coder to use for
values associated with the tag.
- CodedTupleTagMap - Class in com.google.cloud.dataflow.sdk.values
-
- Coder<T> - Interface in com.google.cloud.dataflow.sdk.coders
-
A Coder<T> defines how to encode and decode values of type T into byte streams.
- Coder.Context - Class in com.google.cloud.dataflow.sdk.coders
-
The context in which encoding or decoding is being done.
- Coder.Context(boolean) - Constructor for class com.google.cloud.dataflow.sdk.coders.Coder.Context
-
- Coder.NonDeterministicException - Class in com.google.cloud.dataflow.sdk.coders
-
- Coder.NonDeterministicException(Coder<?>, String, Coder.NonDeterministicException) - Constructor for class com.google.cloud.dataflow.sdk.coders.Coder.NonDeterministicException
-
- Coder.NonDeterministicException(Coder<?>, String) - Constructor for class com.google.cloud.dataflow.sdk.coders.Coder.NonDeterministicException
-
- Coder.NonDeterministicException(Coder<?>, List<String>) - Constructor for class com.google.cloud.dataflow.sdk.coders.Coder.NonDeterministicException
-
- Coder.NonDeterministicException(Coder<?>, List<String>, Coder.NonDeterministicException) - Constructor for class com.google.cloud.dataflow.sdk.coders.Coder.NonDeterministicException
-
- CoderException - Exception in com.google.cloud.dataflow.sdk.coders
-
A CoderException is thrown if there is a problem encoding or
decoding a value.
- CoderException(String) - Constructor for exception com.google.cloud.dataflow.sdk.coders.CoderException
-
- CoderException(String, Throwable) - Constructor for exception com.google.cloud.dataflow.sdk.coders.CoderException
-
- CoderException(Throwable) - Constructor for exception com.google.cloud.dataflow.sdk.coders.CoderException
-
- CoderRegistry - Class in com.google.cloud.dataflow.sdk.coders
-
A CoderRegistry allows registering the default Coder to use for a Java class,
and looking up and instantiating the default Coder for a Java type.
- CoderRegistry() - Constructor for class com.google.cloud.dataflow.sdk.coders.CoderRegistry
-
- CoderRegistry.CoderFactory - Class in com.google.cloud.dataflow.sdk.coders
-
A factory for default Coders for values of a particular class.
- CoderRegistry.CoderFactory() - Constructor for class com.google.cloud.dataflow.sdk.coders.CoderRegistry.CoderFactory
-
- CoderRegistry.ConstantCoderFactory - Class in com.google.cloud.dataflow.sdk.coders
-
A factory that always returns the coder with which it is instantiated.
- CoderRegistry.ConstantCoderFactory(Coder<?>) - Constructor for class com.google.cloud.dataflow.sdk.coders.CoderRegistry.ConstantCoderFactory
-
- CoGbkResult - Class in com.google.cloud.dataflow.sdk.transforms.join
-
A row result of a CoGroupByKey.
- CoGbkResult(CoGbkResultSchema, Iterable<RawUnionValue>) - Constructor for class com.google.cloud.dataflow.sdk.transforms.join.CoGbkResult
-
A row in the PCollection resulting from a CoGroupByKey transform.
- CoGbkResult.CoGbkResultCoder - Class in com.google.cloud.dataflow.sdk.transforms.join
-
A coder for CoGbkResults.
- CoGbkResultSchema - Class in com.google.cloud.dataflow.sdk.transforms.join
-
A schema for the results of a CoGroupByKey.
- CoGbkResultSchema(TupleTagList) - Constructor for class com.google.cloud.dataflow.sdk.transforms.join.CoGbkResultSchema
-
Builds a schema from a tuple of TupleTag<?>s.
- CoGroupByKey<K> - Class in com.google.cloud.dataflow.sdk.transforms.join
-
A transform that performs a CoGroupByKey on a tuple of tables.
- CollectionCoder<T> - Class in com.google.cloud.dataflow.sdk.coders
-
A CollectionCoder encodes Collections.
- CollectionCoder(Coder<T>) - Constructor for class com.google.cloud.dataflow.sdk.coders.CollectionCoder
-
- com.google.cloud.dataflow.sdk - package com.google.cloud.dataflow.sdk
-
Provides a simple, powerful model for building both batch and
streaming parallel data processing
Pipelines.
- com.google.cloud.dataflow.sdk.coders - package com.google.cloud.dataflow.sdk.coders
-
Defines
Coders
to specify how data is encoded to and decoded from byte strings.
- com.google.cloud.dataflow.sdk.io - package com.google.cloud.dataflow.sdk.io
-
- com.google.cloud.dataflow.sdk.options - package com.google.cloud.dataflow.sdk.options
-
- com.google.cloud.dataflow.sdk.runners - package com.google.cloud.dataflow.sdk.runners
-
- com.google.cloud.dataflow.sdk.testing - package com.google.cloud.dataflow.sdk.testing
-
Defines utilities for unit testing Dataflow pipelines.
- com.google.cloud.dataflow.sdk.transforms - package com.google.cloud.dataflow.sdk.transforms
-
Defines
PTransforms for transforming
data in a pipeline.
- com.google.cloud.dataflow.sdk.transforms.join - package com.google.cloud.dataflow.sdk.transforms.join
-
Defines the
CoGroupByKey transform
for joining multiple PCollections.
- com.google.cloud.dataflow.sdk.transforms.windowing - package com.google.cloud.dataflow.sdk.transforms.windowing
-
Defines the
Window transform
for dividing the elements in a PCollection into windows.
- com.google.cloud.dataflow.sdk.values - package com.google.cloud.dataflow.sdk.values
-
- Combine - Class in com.google.cloud.dataflow.sdk.transforms
-
PTransforms for combining PCollection elements
globally and per-key.
- Combine() - Constructor for class com.google.cloud.dataflow.sdk.transforms.Combine
-
- Combine.AccumulatingCombineFn<VI,VA extends Combine.AccumulatingCombineFn.Accumulator<VI,VA,VO>,VO> - Class in com.google.cloud.dataflow.sdk.transforms
-
- Combine.AccumulatingCombineFn() - Constructor for class com.google.cloud.dataflow.sdk.transforms.Combine.AccumulatingCombineFn
-
- Combine.AccumulatingCombineFn.Accumulator<VI,VA,VO> - Interface in com.google.cloud.dataflow.sdk.transforms
-
The type of mutable accumulator values used by this
AccumulatingCombineFn.
- Combine.CombineFn<VI,VA,VO> - Class in com.google.cloud.dataflow.sdk.transforms
-
A CombineFn<VI, VA, VO> specifies how to combine a
collection of input values of type VI into a single
output value of type VO.
- Combine.CombineFn() - Constructor for class com.google.cloud.dataflow.sdk.transforms.Combine.CombineFn
-
- Combine.Globally<VI,VO> - Class in com.google.cloud.dataflow.sdk.transforms
-
Combine.Globally<VI, VO> takes a
PCollection<VI>
and returns a
PCollection<VO> whose single element is the result of
combining all the elements of the input
PCollection,
using a specified
CombineFn.
- Combine.GroupedValues<K,VI,VO> - Class in com.google.cloud.dataflow.sdk.transforms
-
GroupedValues<K, VI, VO> takes a
PCollection<KV<K, Iterable<VI>>>, such as the result of
GroupByKey, applies a specified
KeyedCombineFn
to each of the input
KV<K, Iterable<VI>> elements to
produce a combined output
KV<K, VO> element, and returns a
PCollection<KV<K, VO>> containing all the combined output
elements.
- Combine.KeyedCombineFn<K,VI,VA,VO> - Class in com.google.cloud.dataflow.sdk.transforms
-
A KeyedCombineFn<K, VI, VA, VO> specifies how to combine
a collection of input values of type VI, associated with
a key of type K, into a single output value of type
VO.
- Combine.KeyedCombineFn() - Constructor for class com.google.cloud.dataflow.sdk.transforms.Combine.KeyedCombineFn
-
- Combine.PerKey<K,VI,VO> - Class in com.google.cloud.dataflow.sdk.transforms
-
PerKey<K, VI, VO> takes a
PCollection<KV<K, VI>>, groups it by key, applies a
combining function to the VI values associated with each
key to produce a combined VO value, and returns a
PCollection<KV<K, VO>> representing a map from each
distinct key of the input PCollection to the corresponding
combined value.
- Combine.PerKeyWithHotKeys<K,VI,VO> - Class in com.google.cloud.dataflow.sdk.transforms
-
- compare(T, T) - Method in class com.google.cloud.dataflow.sdk.transforms.Top.Largest
-
- compare(T, T) - Method in class com.google.cloud.dataflow.sdk.transforms.Top.Smallest
-
- compare(KV<K, V>, KV<K, V>) - Method in class com.google.cloud.dataflow.sdk.values.KV.OrderByKey
-
- compare(KV<K, V>, KV<K, V>) - Method in class com.google.cloud.dataflow.sdk.values.KV.OrderByValue
-
- compareTo(IntervalWindow) - Method in class com.google.cloud.dataflow.sdk.transforms.windowing.IntervalWindow
-
- computeMetadataString() - Method in class com.google.cloud.dataflow.sdk.options.CloudDebuggerOptions.DebuggerConfig
-
Compute the string of Debugger config.
- contains(IntervalWindow) - Method in class com.google.cloud.dataflow.sdk.transforms.windowing.IntervalWindow
-
Returns whether this window contains the given window.
- containsInAnyOrder(Iterable<T>) - Method in class com.google.cloud.dataflow.sdk.testing.DataflowAssert.IterableAssert
-
Checks that the Iterable contains the expected elements, in any
order.
- containsInAnyOrder(T...) - Method in class com.google.cloud.dataflow.sdk.testing.DataflowAssert.IterableAssert
-
Checks that the Iterable contains the expected elements, in any
order.
- containsInOrder(T...) - Method in class com.google.cloud.dataflow.sdk.testing.DataflowAssert.IterableAssert
-
Checks that the Iterable contains the expected elements, in the
specified order.
- containsInOrder(Iterable<T>) - Method in class com.google.cloud.dataflow.sdk.testing.DataflowAssert.IterableAssert
-
Checks that the Iterable contains the expected elements, in the
specified order.
- Count - Class in com.google.cloud.dataflow.sdk.transforms
-
Count<T> takes a PCollection<T> and returns a
PCollection<KV<T, Long>> representing a map from each
distinct element of the input PCollection to the number of times
that element occurs in the input.
- Count() - Constructor for class com.google.cloud.dataflow.sdk.transforms.Count
-
- Count.Globally<T> - Class in com.google.cloud.dataflow.sdk.transforms
-
Count.Globally<T> takes a PCollection<T> and returns a
PCollection<Long> containing a single element which is the total
number of elements in the PCollection.
- Count.Globally() - Constructor for class com.google.cloud.dataflow.sdk.transforms.Count.Globally
-
- Count.PerElement<T> - Class in com.google.cloud.dataflow.sdk.transforms
-
Count.PerElement<T> takes a PCollection<T> and returns a
PCollection<KV<T, Long>> representing a map from each
distinct element of the input PCollection to the number of times
that element occurs in the input.
- Count.PerElement() - Constructor for class com.google.cloud.dataflow.sdk.transforms.Count.PerElement
-
- create(List<? extends Coder<?>>) - Method in class com.google.cloud.dataflow.sdk.coders.CoderRegistry.CoderFactory
-
Returns the default Coder to use for values of a particular type,
given the Coders for each of the type's generic parameter types.
- create(List<? extends Coder<?>>) - Method in class com.google.cloud.dataflow.sdk.coders.CoderRegistry.ConstantCoderFactory
-
- create(PipelineOptions) - Method in class com.google.cloud.dataflow.sdk.options.BlockingDataflowPipelineOptions.StandardOutputFactory
-
- create(PipelineOptions) - Method in class com.google.cloud.dataflow.sdk.options.DataflowPipelineDebugOptions.PathValidatorFactory
-
- create(PipelineOptions) - Method in class com.google.cloud.dataflow.sdk.options.DataflowPipelineDebugOptions.StagerFactory
-
- create(PipelineOptions) - Method in class com.google.cloud.dataflow.sdk.options.DataflowPipelineOptions.DataflowClientFactory
-
- create(PipelineOptions) - Method in class com.google.cloud.dataflow.sdk.options.DataflowPipelineOptions.JobNameFactory
-
- create(String) - Static method in class com.google.cloud.dataflow.sdk.options.DataflowWorkerLoggingOptions.WorkerLogLevelOverride
-
Expects a value of the form Name#Level.
- create(PipelineOptions) - Method in interface com.google.cloud.dataflow.sdk.options.DefaultValueFactory
-
- create(PipelineOptions) - Method in class com.google.cloud.dataflow.sdk.options.GcpOptions.CredentialDirFactory
-
- create(PipelineOptions) - Method in class com.google.cloud.dataflow.sdk.options.GcpOptions.GcpUserCredentialsFactory
-
- create(PipelineOptions) - Method in class com.google.cloud.dataflow.sdk.options.GcsOptions.ExecutorServiceFactory
-
- create(AbstractGoogleClient, String) - Static method in class com.google.cloud.dataflow.sdk.options.GoogleApiDebugOptions.GoogleApiTracer
-
- create(AbstractGoogleClientRequest<?>, String) - Static method in class com.google.cloud.dataflow.sdk.options.GoogleApiDebugOptions.GoogleApiTracer
-
- create(String) - Static method in class com.google.cloud.dataflow.sdk.options.GoogleApiDebugOptions.GoogleApiTracer
-
Creates a GoogleClientRequestInitializer which adds the trace token
based upon the passed in value.
- create() - Method in class com.google.cloud.dataflow.sdk.options.PipelineOptionsFactory.Builder
-
Creates and returns an object which implements
PipelineOptions using the values
configured on this builder during construction.
- create() - Static method in class com.google.cloud.dataflow.sdk.options.PipelineOptionsFactory
-
- create(PipelineOptions) - Static method in class com.google.cloud.dataflow.sdk.Pipeline
-
Constructs a pipeline from the provided options.
- create(DataflowPipelineOptions) - Static method in class com.google.cloud.dataflow.sdk.runners.DataflowPipeline
-
Creates and returns a new DataflowPipeline instance for tests.
- create(Combine.GroupedValues<K, VI, VO>, boolean) - Static method in class com.google.cloud.dataflow.sdk.runners.DirectPipelineRunner.TestCombineDoFn
-
- create() - Static method in class com.google.cloud.dataflow.sdk.testing.TestPipeline
-
Creates and returns a new test pipeline.
- create(int, C) - Static method in class com.google.cloud.dataflow.sdk.transforms.ApproximateQuantiles.ApproximateQuantilesCombineFn
-
Returns an approximate quantiles combiner with the given
compareFn and desired number of quantiles.
- create(int) - Static method in class com.google.cloud.dataflow.sdk.transforms.ApproximateQuantiles.ApproximateQuantilesCombineFn
-
- create(int, C, long, double) - Static method in class com.google.cloud.dataflow.sdk.transforms.ApproximateQuantiles.ApproximateQuantilesCombineFn
-
Creates an approximate quantiles combiner with the given
compareFn and desired number of quantiles.
- Create<T> - Class in com.google.cloud.dataflow.sdk.transforms
-
Create<T> takes a collection of elements of type T
known when the pipeline is constructed and returns a
PCollection<T> containing the elements.
- create() - Static method in class com.google.cloud.dataflow.sdk.transforms.Flatten
-
Deprecated.
- create() - Static method in class com.google.cloud.dataflow.sdk.transforms.GroupByKey
-
Returns a GroupByKey<K, V> PTransform.
- create() - Static method in class com.google.cloud.dataflow.sdk.transforms.join.CoGroupByKey
-
Returns a CoGroupByKey<K> PTransform.
- create() - Static method in class com.google.cloud.dataflow.sdk.transforms.Keys
-
Returns a Keys<K> PTransform.
- create() - Static method in class com.google.cloud.dataflow.sdk.transforms.KvSwap
-
Returns a KvSwap<A, B> PTransform.
- create() - Static method in class com.google.cloud.dataflow.sdk.transforms.RemoveDuplicates
-
Returns a RemoveDuplicates<T> PTransform.
- create() - Static method in class com.google.cloud.dataflow.sdk.transforms.Values
-
Returns a Values<V> PTransform.
- createAccumulator() - Method in class com.google.cloud.dataflow.sdk.transforms.ApproximateQuantiles.ApproximateQuantilesCombineFn
-
- createAccumulator() - Method in class com.google.cloud.dataflow.sdk.transforms.ApproximateUnique.ApproximateUniqueCombineFn
-
- createAccumulator() - Method in class com.google.cloud.dataflow.sdk.transforms.Combine.CombineFn
-
Returns a new, mutable accumulator value, representing the
accumulation of zero input values.
- createAccumulator(K) - Method in class com.google.cloud.dataflow.sdk.transforms.Combine.KeyedCombineFn
-
Returns a new, mutable accumulator value representing the
accumulation of zero input values.
- createAccumulator() - Method in class com.google.cloud.dataflow.sdk.transforms.Mean.MeanFn
-
- createAccumulator() - Method in class com.google.cloud.dataflow.sdk.transforms.Sample.FixedSizedSampleFn
-
- createAccumulator() - Method in class com.google.cloud.dataflow.sdk.transforms.Top.TopCombineFn
-
- createAggregator(String, Combine.CombineFn<? super AI, AA, AO>) - Method in class com.google.cloud.dataflow.sdk.transforms.DoFn.Context
-
Returns an aggregator with aggregation logic specified by the CombineFn
argument.
- createAggregator(String, SerializableFunction<Iterable<AI>, AO>) - Method in class com.google.cloud.dataflow.sdk.transforms.DoFn.Context
-
Returns an aggregator with aggregation logic specified by the
SerializableFunction argument.
- createBasicReader(PipelineOptions, Coder<DatastoreV1.Entity>, ExecutionContext) - Method in class com.google.cloud.dataflow.sdk.io.DatastoreIO.Source
-
- createBasicReader(PipelineOptions, Coder<T>, ExecutionContext) - Method in class com.google.cloud.dataflow.sdk.io.Source
-
Creates a basic (non-windowed) reader for this source.
- createDatumReader() - Method in class com.google.cloud.dataflow.sdk.coders.AvroCoder
-
Returns a new DatumReader that can be used to read from
an Avro file directly.
- createDatumWriter() - Method in class com.google.cloud.dataflow.sdk.coders.AvroCoder
-
Returns a new DatumWriter that can be used to write to
an Avro file directly.
- createForTest() - Static method in class com.google.cloud.dataflow.sdk.runners.DirectPipeline
-
Creates and returns a new DirectPipeline instance for tests.
- createForTest() - Static method in class com.google.cloud.dataflow.sdk.runners.DirectPipelineRunner
-
Constructs a runner with default properties for testing.
- createFromSystemProperties() - Static method in class com.google.cloud.dataflow.sdk.options.PipelineOptionsFactory
-
Deprecated.
- createInputStream(InputStream) - Method in enum com.google.cloud.dataflow.sdk.io.TextIO.CompressionType
-
- createPrimitiveOutputInternal(WindowFn<?, ?>) - Static method in class com.google.cloud.dataflow.sdk.values.PCollection
-
Creates and returns a new PCollection for a primitive output.
- createWindowedReader(PipelineOptions, Coder<WindowedValue<T>>, ExecutionContext) - Method in class com.google.cloud.dataflow.sdk.io.Source
-
Creates a windowed reader for this source.
- CustomCoder<T> - Class in com.google.cloud.dataflow.sdk.coders
-
An abstract base class for writing Coders that encodes itself via java
serialization.
- CustomCoder() - Constructor for class com.google.cloud.dataflow.sdk.coders.CustomCoder
-
- DataflowAssert - Class in com.google.cloud.dataflow.sdk.testing
-
An assertion on the contents of a
PCollection
incorporated into the pipeline.
- DataflowAssert.AssertRelation<Actual,Expected> - Interface in com.google.cloud.dataflow.sdk.testing
-
A serializable function implementing a binary predicate
between types Actual and Expected.
- DataflowAssert.AssertThat<Actual,Expected> - Class in com.google.cloud.dataflow.sdk.testing
-
- DataflowAssert.IterableAssert<T> - Class in com.google.cloud.dataflow.sdk.testing
-
- DataflowAssert.IterableAssert(PCollectionView<Iterable<T>, ?>) - Constructor for class com.google.cloud.dataflow.sdk.testing.DataflowAssert.IterableAssert
-
- DataflowAssert.SingletonAssert<T> - Class in com.google.cloud.dataflow.sdk.testing
-
An assertion about the single value of type
T
associated with a
PCollectionView.
- DataflowAssert.SingletonAssert(PCollectionView<T, ?>) - Constructor for class com.google.cloud.dataflow.sdk.testing.DataflowAssert.SingletonAssert
-
- DataflowPipeline - Class in com.google.cloud.dataflow.sdk.runners
-
- DataflowPipelineDebugOptions - Interface in com.google.cloud.dataflow.sdk.options
-
Options used for testing and debugging the Dataflow SDK.
- DataflowPipelineDebugOptions.PathValidatorFactory - Class in com.google.cloud.dataflow.sdk.options
-
- DataflowPipelineDebugOptions.PathValidatorFactory() - Constructor for class com.google.cloud.dataflow.sdk.options.DataflowPipelineDebugOptions.PathValidatorFactory
-
- DataflowPipelineDebugOptions.StagerFactory - Class in com.google.cloud.dataflow.sdk.options
-
- DataflowPipelineDebugOptions.StagerFactory() - Constructor for class com.google.cloud.dataflow.sdk.options.DataflowPipelineDebugOptions.StagerFactory
-
- DataflowPipelineJob - Class in com.google.cloud.dataflow.sdk.runners
-
- DataflowPipelineJob(String, String, Dataflow) - Constructor for class com.google.cloud.dataflow.sdk.runners.DataflowPipelineJob
-
Construct the job.
- DataflowPipelineOptions - Interface in com.google.cloud.dataflow.sdk.options
-
- DataflowPipelineOptions.DataflowClientFactory - Class in com.google.cloud.dataflow.sdk.options
-
Returns the default Dataflow client built from the passed in PipelineOptions.
- DataflowPipelineOptions.DataflowClientFactory() - Constructor for class com.google.cloud.dataflow.sdk.options.DataflowPipelineOptions.DataflowClientFactory
-
- DataflowPipelineOptions.JobNameFactory - Class in com.google.cloud.dataflow.sdk.options
-
- DataflowPipelineOptions.JobNameFactory() - Constructor for class com.google.cloud.dataflow.sdk.options.DataflowPipelineOptions.JobNameFactory
-
- DataflowPipelineRegistrar - Class in com.google.cloud.dataflow.sdk.runners
-
- DataflowPipelineRegistrar.Options - Class in com.google.cloud.dataflow.sdk.runners
-
- DataflowPipelineRegistrar.Options() - Constructor for class com.google.cloud.dataflow.sdk.runners.DataflowPipelineRegistrar.Options
-
- DataflowPipelineRegistrar.Runner - Class in com.google.cloud.dataflow.sdk.runners
-
- DataflowPipelineRegistrar.Runner() - Constructor for class com.google.cloud.dataflow.sdk.runners.DataflowPipelineRegistrar.Runner
-
- DataflowPipelineRunner - Class in com.google.cloud.dataflow.sdk.runners
-
A
PipelineRunner that executes the operations in the
pipeline by first translating them to the Dataflow representation
using the
DataflowPipelineTranslator and then submitting
them to a Dataflow service for execution.
- DataflowPipelineRunnerHooks - Class in com.google.cloud.dataflow.sdk.runners
-
An instance of this class can be passed to the
DataflowPipeline runner to add user defined hooks to be
invoked at various times during pipeline execution.
- DataflowPipelineRunnerHooks() - Constructor for class com.google.cloud.dataflow.sdk.runners.DataflowPipelineRunnerHooks
-
- DataflowPipelineTranslator - Class in com.google.cloud.dataflow.sdk.runners
-
DataflowPipelineTranslator knows how to translate Pipeline objects
into Dataflow API Jobs.
- DataflowPipelineTranslator.TransformTranslator<PT extends PTransform> - Interface in com.google.cloud.dataflow.sdk.runners
-
- DataflowPipelineTranslator.TranslationContext - Interface in com.google.cloud.dataflow.sdk.runners
-
- DataflowPipelineWorkerPoolOptions - Interface in com.google.cloud.dataflow.sdk.options
-
Options which are used to configure the Dataflow pipeline worker pool.
- DataflowPipelineWorkerPoolOptions.AutoscalingAlgorithmType - Enum in com.google.cloud.dataflow.sdk.options
-
Type of autoscaling algorithm to use.
- DataflowPipelineWorkerPoolOptions.ClusterManagerApiType - Enum in com.google.cloud.dataflow.sdk.options
-
Type of API for handling cluster management,i.e.
- DataflowPipelineWorkerPoolOptions.TeardownPolicy - Enum in com.google.cloud.dataflow.sdk.options
-
The policy for tearing down the workers spun up by the service.
- DataflowWorkerHarnessOptions - Interface in com.google.cloud.dataflow.sdk.options
-
Options which are used exclusively within the Dataflow worker harness.
- DataflowWorkerLoggingOptions - Interface in com.google.cloud.dataflow.sdk.options
-
Options which are used to control logging configuration on the Dataflow worker.
- DataflowWorkerLoggingOptions.Level - Enum in com.google.cloud.dataflow.sdk.options
-
The set of log levels which can be used on the Dataflow worker.
- DataflowWorkerLoggingOptions.WorkerLogLevelOverride - Class in com.google.cloud.dataflow.sdk.options
-
Defines a log level override for a specific class, package, or name.
- DATASTORE_BATCH_UPDATE_LIMIT - Static variable in class com.google.cloud.dataflow.sdk.io.DatastoreIO
-
Datastore has a limit of 500 mutations per batch operation, so we flush
changes to Datastore every 500 entities.
- DatastoreIO - Class in com.google.cloud.dataflow.sdk.io
-
- DatastoreIO() - Constructor for class com.google.cloud.dataflow.sdk.io.DatastoreIO
-
- DatastoreIO.DatastoreReader - Class in com.google.cloud.dataflow.sdk.io
-
An iterator over the records from a query of the datastore.
- DatastoreIO.DatastoreReader(DatastoreV1.Query, Datastore) - Constructor for class com.google.cloud.dataflow.sdk.io.DatastoreIO.DatastoreReader
-
Returns a DatastoreIterator with query and Datastore object set.
- DatastoreIO.Sink - Class in com.google.cloud.dataflow.sdk.io
-
A
PTransform that writes a
PCollection<Entity> containing
entities to a Datastore kind.
- DatastoreIO.Source - Class in com.google.cloud.dataflow.sdk.io
-
A source that reads the result rows of a Datastore query as Entity objects.
- days(int) - Static method in class com.google.cloud.dataflow.sdk.transforms.windowing.CalendarWindows
-
Returns a
WindowFn that windows elements into periods measured by days.
- decode(InputStream, Coder.Context) - Method in class com.google.cloud.dataflow.sdk.coders.AvroCoder
-
- decode(InputStream, Coder.Context) - Method in class com.google.cloud.dataflow.sdk.coders.BigEndianIntegerCoder
-
- decode(InputStream, Coder.Context) - Method in class com.google.cloud.dataflow.sdk.coders.BigEndianLongCoder
-
- decode(InputStream, Coder.Context) - Method in class com.google.cloud.dataflow.sdk.coders.ByteArrayCoder
-
- decode(InputStream, Coder.Context) - Method in interface com.google.cloud.dataflow.sdk.coders.Coder
-
Decodes a value of type T from the given input stream in
the given context.
- decode(InputStream, Coder.Context) - Method in class com.google.cloud.dataflow.sdk.coders.DelegateCoder
-
- decode(InputStream, Coder.Context) - Method in class com.google.cloud.dataflow.sdk.coders.DoubleCoder
-
- decode(InputStream, Coder.Context) - Method in class com.google.cloud.dataflow.sdk.coders.EntityCoder
-
- decode(InputStream, Coder.Context) - Method in class com.google.cloud.dataflow.sdk.coders.InstantCoder
-
- decode(InputStream, Coder.Context) - Method in class com.google.cloud.dataflow.sdk.coders.IterableLikeCoder
-
- decode(InputStream, Coder.Context) - Method in class com.google.cloud.dataflow.sdk.coders.KvCoder
-
- decode(InputStream, Coder.Context) - Method in class com.google.cloud.dataflow.sdk.coders.MapCoder
-
- decode(InputStream, Coder.Context) - Method in class com.google.cloud.dataflow.sdk.coders.SerializableCoder
-
- decode(InputStream, Coder.Context) - Method in class com.google.cloud.dataflow.sdk.coders.StringUtf8Coder
-
- decode(InputStream, Coder.Context) - Method in class com.google.cloud.dataflow.sdk.coders.TableRowJsonCoder
-
- decode(InputStream, Coder.Context) - Method in class com.google.cloud.dataflow.sdk.coders.TextualIntegerCoder
-
- decode(InputStream, Coder.Context) - Method in class com.google.cloud.dataflow.sdk.coders.VarIntCoder
-
- decode(InputStream, Coder.Context) - Method in class com.google.cloud.dataflow.sdk.coders.VarLongCoder
-
- decode(InputStream, Coder.Context) - Method in class com.google.cloud.dataflow.sdk.coders.VoidCoder
-
- decode(InputStream, Coder.Context) - Method in class com.google.cloud.dataflow.sdk.transforms.join.CoGbkResult.CoGbkResultCoder
-
- decode(InputStream, Coder.Context) - Method in class com.google.cloud.dataflow.sdk.transforms.windowing.GlobalWindow.Coder
-
- decode(InputStream, Coder.Context) - Method in class com.google.cloud.dataflow.sdk.values.TimestampedValue.TimestampedValueCoder
-
- decodeToIterable(List<T>) - Method in class com.google.cloud.dataflow.sdk.coders.CollectionCoder
-
- decodeToIterable(List<T>) - Method in class com.google.cloud.dataflow.sdk.coders.IterableCoder
-
- decodeToIterable(List<T>) - Method in class com.google.cloud.dataflow.sdk.coders.IterableLikeCoder
-
Builds an instance of the coder's associated Iterable from a list
of decoded elements.
- decodeToIterable(List<T>) - Method in class com.google.cloud.dataflow.sdk.coders.ListCoder
-
- decodeToIterable(List<T>) - Method in class com.google.cloud.dataflow.sdk.coders.SetCoder
-
- Default - Annotation Type in com.google.cloud.dataflow.sdk.options
-
Default represents a set of annotations which can be used to annotate getter properties
on
PipelineOptions with information representing the default value to be returned
if no value is specified.
- Default.Boolean - Annotation Type in com.google.cloud.dataflow.sdk.options
-
This represents that the default of the option is the specified boolean primitive value.
- Default.Byte - Annotation Type in com.google.cloud.dataflow.sdk.options
-
This represents that the default of the option is the specified byte primitive value.
- Default.Character - Annotation Type in com.google.cloud.dataflow.sdk.options
-
This represents that the default of the option is the specified char primitive value.
- Default.Class - Annotation Type in com.google.cloud.dataflow.sdk.options
-
This represents that the default of the option is the specified Class value.
- Default.Double - Annotation Type in com.google.cloud.dataflow.sdk.options
-
This represents that the default of the option is the specified double primitive value.
- Default.Enum - Annotation Type in com.google.cloud.dataflow.sdk.options
-
This represents that the default of the option is the specified enum.
- Default.Float - Annotation Type in com.google.cloud.dataflow.sdk.options
-
This represents that the default of the option is the specified float primitive value.
- Default.InstanceFactory - Annotation Type in com.google.cloud.dataflow.sdk.options
-
- Default.Integer - Annotation Type in com.google.cloud.dataflow.sdk.options
-
This represents that the default of the option is the specified int primitive value.
- Default.Long - Annotation Type in com.google.cloud.dataflow.sdk.options
-
This represents that the default of the option is the specified long primitive value.
- Default.Short - Annotation Type in com.google.cloud.dataflow.sdk.options
-
This represents that the default of the option is the specified short primitive value.
- Default.String - Annotation Type in com.google.cloud.dataflow.sdk.options
-
This represents that the default of the option is the specified String
value.
- DEFAULT_HOST - Static variable in class com.google.cloud.dataflow.sdk.io.DatastoreIO
-
- DEFAULT_MAX_NUM_ELEMENTS - Static variable in class com.google.cloud.dataflow.sdk.transforms.ApproximateQuantiles.ApproximateQuantilesCombineFn
-
The cost (in time and space) to compute quantiles to a given
accuracy is a function of the total number of elements in the
data set.
- DEFAULT_TEXT_CODER - Static variable in class com.google.cloud.dataflow.sdk.io.TextIO
-
- DefaultCoder - Annotation Type in com.google.cloud.dataflow.sdk.coders
-
Specifies a default
Coder class to handle encoding and decoding
instances of the annotated class.
- DefaultValueFactory<T> - Interface in com.google.cloud.dataflow.sdk.options
-
- DelegateCoder<T,DT> - Class in com.google.cloud.dataflow.sdk.coders
-
A
DelegateCoder<T, DT> wraps a
Coder and
encodes/decodes values of type
Ts by converting
to/from
DT and then encoding/decoding using the underlying
Coder.
- DelegateCoder(Coder<DT>, DelegateCoder.CodingFunction<T, DT>, DelegateCoder.CodingFunction<DT, T>) - Constructor for class com.google.cloud.dataflow.sdk.coders.DelegateCoder
-
- DelegateCoder.CodingFunction<Input,Output> - Interface in com.google.cloud.dataflow.sdk.coders
-
A CodingFunction<Input, Output> is a serializable function
from Input to Output that
may throw any Exception.
- Description - Annotation Type in com.google.cloud.dataflow.sdk.options
-
- detectClassPathResourcesToStage(ClassLoader) - Static method in class com.google.cloud.dataflow.sdk.runners.DataflowPipelineRunner
-
Attempts to detect all the resources the class loader has access to.
- DIRECTORY_CONTAINER - Static variable in class com.google.cloud.dataflow.sdk.io.ShardNameTemplate
-
Shard is a file within a directory.
- DirectPipeline - Class in com.google.cloud.dataflow.sdk.runners
-
- DirectPipelineOptions - Interface in com.google.cloud.dataflow.sdk.options
-
- DirectPipelineRegistrar - Class in com.google.cloud.dataflow.sdk.runners
-
- DirectPipelineRegistrar.Options - Class in com.google.cloud.dataflow.sdk.runners
-
- DirectPipelineRegistrar.Options() - Constructor for class com.google.cloud.dataflow.sdk.runners.DirectPipelineRegistrar.Options
-
- DirectPipelineRegistrar.Runner - Class in com.google.cloud.dataflow.sdk.runners
-
- DirectPipelineRegistrar.Runner() - Constructor for class com.google.cloud.dataflow.sdk.runners.DirectPipelineRegistrar.Runner
-
- DirectPipelineRunner - Class in com.google.cloud.dataflow.sdk.runners
-
Executes the operations in the pipeline directly, in this process, without
any optimization.
- DirectPipelineRunner.EvaluationContext - Interface in com.google.cloud.dataflow.sdk.runners
-
The interface provided to registered callbacks for interacting
with the
DirectPipelineRunner, including reading and writing the
values of
PCollections and
PCollectionViews.
- DirectPipelineRunner.EvaluationResults - Interface in com.google.cloud.dataflow.sdk.runners
-
The interface provided to registered callbacks for interacting
with the
DirectPipelineRunner, including reading and writing the
values of
PCollections and
PCollectionViews.
- DirectPipelineRunner.TestCombineDoFn<K,VI,VA,VO> - Class in com.google.cloud.dataflow.sdk.runners
-
The implementation may split the KeyedCombineFn into ADD, MERGE
and EXTRACT phases (see CombineValuesFn).
- DirectPipelineRunner.TestCombineDoFn(Combine.KeyedCombineFn<? super K, ? super VI, VA, VO>, Coder<VA>, boolean) - Constructor for class com.google.cloud.dataflow.sdk.runners.DirectPipelineRunner.TestCombineDoFn
-
- DirectPipelineRunner.TransformEvaluator<PT extends PTransform> - Interface in com.google.cloud.dataflow.sdk.runners
-
An evaluator of a PTransform.
- DirectPipelineRunner.ValueWithMetadata<V> - Class in com.google.cloud.dataflow.sdk.runners
-
An immutable (value, timestamp) pair, along with other metadata necessary
for the implementation of DirectPipelineRunner.
- DoFn<I,O> - Class in com.google.cloud.dataflow.sdk.transforms
-
The argument to
ParDo providing the code to use to process
elements of the input
PCollection.
- DoFn() - Constructor for class com.google.cloud.dataflow.sdk.transforms.DoFn
-
- DoFn.Context - Class in com.google.cloud.dataflow.sdk.transforms
-
Information accessible to all methods in this DoFn.
- DoFn.Context() - Constructor for class com.google.cloud.dataflow.sdk.transforms.DoFn.Context
-
- DoFn.KeyedState - Interface in com.google.cloud.dataflow.sdk.transforms
-
Interface for interacting with keyed state.
- DoFn.ProcessContext - Class in com.google.cloud.dataflow.sdk.transforms
-
- DoFn.ProcessContext() - Constructor for class com.google.cloud.dataflow.sdk.transforms.DoFn.ProcessContext
-
- DoFn.RequiresKeyedState - Interface in com.google.cloud.dataflow.sdk.transforms
-
- DoFnTester<I,O> - Class in com.google.cloud.dataflow.sdk.transforms
-
A harness for unit-testing a
DoFn.
- DoubleCoder - Class in com.google.cloud.dataflow.sdk.coders
-
A DoubleCoder encodes Doubles in 8 bytes.
- doublesGlobally() - Static method in class com.google.cloud.dataflow.sdk.transforms.Max
-
Returns a PTransform that takes an input
PCollection<Double> and returns a
PCollection<Double> whose contents is the maximum of the
input PCollection's elements, or
Double.MIN_VALUE if there are no elements.
- doublesGlobally() - Static method in class com.google.cloud.dataflow.sdk.transforms.Min
-
Returns a PTransform that takes an input
PCollection<Double> and returns a
PCollection<Double> whose contents is the minimum of the
input PCollection's elements, or
Double.MAX_VALUE if there are no elements.
- doublesGlobally() - Static method in class com.google.cloud.dataflow.sdk.transforms.Sum
-
Returns a PTransform that takes an input
PCollection<Double> and returns a
PCollection<Double> whose contents is the sum of the
input PCollection's elements, or
0 if there are no elements.
- doublesPerKey() - Static method in class com.google.cloud.dataflow.sdk.transforms.Max
-
Returns a PTransform that takes an input
PCollection<KV<K, Double>> and returns a
PCollection<KV<K, Double>> that contains an output
element mapping each distinct key in the input
PCollection to the maximum of the values associated with
that key in the input PCollection.
- doublesPerKey() - Static method in class com.google.cloud.dataflow.sdk.transforms.Min
-
Returns a PTransform that takes an input
PCollection<KV<K, Double>> and returns a
PCollection<KV<K, Double>> that contains an output
element mapping each distinct key in the input
PCollection to the minimum of the values associated with
that key in the input PCollection.
- doublesPerKey() - Static method in class com.google.cloud.dataflow.sdk.transforms.Sum
-
Returns a PTransform that takes an input
PCollection<KV<K, Double>> and returns a
PCollection<KV<K, Double>> that contains an output
element mapping each distinct key in the input
PCollection to the sum of the values associated with
that key in the input PCollection.
- dropLateData(boolean) - Method in class com.google.cloud.dataflow.sdk.io.PubsubIO.Read.Bound
-
Returns a new TextIO.Read PTransform that's like this one but with the specified
setting for dropLateData.
- dropLateData(boolean) - Static method in class com.google.cloud.dataflow.sdk.io.PubsubIO.Read
-
If true, then late-arriving data from this source will be dropped.
- GcpOptions - Interface in com.google.cloud.dataflow.sdk.options
-
Options used to configure Google Cloud Platform project and credentials.
- GcpOptions.CredentialDirFactory - Class in com.google.cloud.dataflow.sdk.options
-
Returns the default credential directory of ${user.home}/.store/data-flow.
- GcpOptions.CredentialDirFactory() - Constructor for class com.google.cloud.dataflow.sdk.options.GcpOptions.CredentialDirFactory
-
- GcpOptions.GcpUserCredentialsFactory - Class in com.google.cloud.dataflow.sdk.options
-
Attempts to load the GCP credentials.
- GcpOptions.GcpUserCredentialsFactory() - Constructor for class com.google.cloud.dataflow.sdk.options.GcpOptions.GcpUserCredentialsFactory
-
- GcsOptions - Interface in com.google.cloud.dataflow.sdk.options
-
Options used to configure Google Cloud Storage.
- GcsOptions.ExecutorServiceFactory - Class in com.google.cloud.dataflow.sdk.options
-
Returns the default ExecutorService to use within the Dataflow SDK.
- GcsOptions.ExecutorServiceFactory() - Constructor for class com.google.cloud.dataflow.sdk.options.GcsOptions.ExecutorServiceFactory
-
- get(CodedTupleTag<T>) - Method in class com.google.cloud.dataflow.sdk.values.CodedTupleTagMap
-
Returns the value associated with the given tag in this
CodedTupleTagMap, or null if the tag has no
asssociated value.
- get(int) - Method in class com.google.cloud.dataflow.sdk.values.PCollectionList
-
Returns the PCollection at the given index (origin zero).
- get(TupleTag<T>) - Method in class com.google.cloud.dataflow.sdk.values.PCollectionTuple
-
Returns the PCollection with the given tag in this
PCollectionTuple.
- get(int) - Method in class com.google.cloud.dataflow.sdk.values.TupleTagList
-
Returns the TupleTag at the given index (origin zero).
- getAccumulatorCoder(CoderRegistry, Coder<T>) - Method in class com.google.cloud.dataflow.sdk.transforms.ApproximateQuantiles.ApproximateQuantilesCombineFn
-
- getAccumulatorCoder(CoderRegistry, Coder<T>) - Method in class com.google.cloud.dataflow.sdk.transforms.ApproximateUnique.ApproximateUniqueCombineFn
-
- getAccumulatorCoder(CoderRegistry, Coder<VI>) - Method in class com.google.cloud.dataflow.sdk.transforms.Combine.CombineFn
-
Returns the Coder to use for accumulator VA
values, or null if it is not able to be inferred.
- getAccumulatorCoder() - Method in class com.google.cloud.dataflow.sdk.transforms.Combine.GroupedValues
-
- getAccumulatorCoder(CoderRegistry, Coder<K>, Coder<VI>) - Method in class com.google.cloud.dataflow.sdk.transforms.Combine.KeyedCombineFn
-
Returns the Coder to use for accumulator VA
values, or null if it is not able to be inferred.
- getAccumulatorCoder(CoderRegistry, Coder<N>) - Method in class com.google.cloud.dataflow.sdk.transforms.Mean.MeanFn
-
- getAccumulatorCoder(CoderRegistry, Coder<T>) - Method in class com.google.cloud.dataflow.sdk.transforms.Sample.FixedSizedSampleFn
-
- getAccumulatorCoder(CoderRegistry, Coder<T>) - Method in class com.google.cloud.dataflow.sdk.transforms.Top.TopCombineFn
-
- getAddCounterMutator() - Method in interface com.google.cloud.dataflow.sdk.runners.DirectPipelineRunner.EvaluationContext
-
Returns a mutator, which can be used to add additional counters to
this EvaluationContext.
- getAlgorithm() - Method in enum com.google.cloud.dataflow.sdk.options.DataflowPipelineWorkerPoolOptions.AutoscalingAlgorithmType
-
Returns the string representation of this type.
- getAll(TupleTag<V>) - Method in class com.google.cloud.dataflow.sdk.transforms.join.CoGbkResult
-
Returns the values from the table represented by the given
TupleTag<V> as an Iterable<V> (which may be empty if there
are no results).
- getAll() - Method in class com.google.cloud.dataflow.sdk.values.PCollectionList
-
Returns an immutable List of all the PCollections in this PCollectionList.
- getAll() - Method in class com.google.cloud.dataflow.sdk.values.PCollectionTuple
-
Returns an immutable Map from TupleTag to corresponding
PCollection, for all the members of this PCollectionTuple.
- getAll() - Method in class com.google.cloud.dataflow.sdk.values.TupleTagList
-
Returns an immutable List of all the TupleTags in this TupleTagList.
- getAllowedTimestampSkew() - Method in class com.google.cloud.dataflow.sdk.transforms.DoFn
-
- getApiRootUrl() - Method in interface com.google.cloud.dataflow.sdk.options.DataflowPipelineDebugOptions
-
The API endpoint to use when communicating with the Dataflow service.
- getApiServiceName() - Method in enum com.google.cloud.dataflow.sdk.options.DataflowPipelineWorkerPoolOptions.ClusterManagerApiType
-
- getAppName() - Method in interface com.google.cloud.dataflow.sdk.options.ApplicationNameOptions
-
Name of application, for display purposes.
- getAutoscalingAlgorithm() - Method in interface com.google.cloud.dataflow.sdk.options.DataflowPipelineWorkerPoolOptions
-
- getCause() - Method in class com.google.cloud.dataflow.sdk.transforms.windowing.InvalidWindows
-
Returns the reason that this WindowFn is invalid.
- getCdbgVersion() - Method in interface com.google.cloud.dataflow.sdk.options.CloudDebuggerOptions
-
User defined application version.
- getClusterManagerApi() - Method in interface com.google.cloud.dataflow.sdk.options.DataflowPipelineWorkerPoolOptions
-
- getCoder() - Method in class com.google.cloud.dataflow.sdk.io.TextIO.Write.Bound
-
- getCoder() - Method in class com.google.cloud.dataflow.sdk.testing.DataflowAssert.IterableAssert
-
Gets the coder, which may yet be absent.
- getCoder() - Method in class com.google.cloud.dataflow.sdk.testing.DataflowAssert.SingletonAssert
-
Gets the coder, which may yet be absent.
- getCoder() - Static method in class com.google.cloud.dataflow.sdk.transforms.windowing.IntervalWindow
-
Returns a Coder suitable for encoding IntervalWindows.
- getCoder() - Method in class com.google.cloud.dataflow.sdk.values.CodedTupleTag
-
Returns the Coder used for values associated with this tag.
- getCoder() - Method in class com.google.cloud.dataflow.sdk.values.PCollection
-
Returns the Coder used by this PCollection to encode and decode
the values stored in it.
- getCoder() - Method in class com.google.cloud.dataflow.sdk.values.TypedPValue
-
Returns the Coder used by this TypedPValue to encode and decode
the values stored in it.
- getCoderArguments() - Method in class com.google.cloud.dataflow.sdk.coders.AtomicCoder
-
- getCoderArguments() - Method in class com.google.cloud.dataflow.sdk.coders.AvroCoder
-
- getCoderArguments() - Method in interface com.google.cloud.dataflow.sdk.coders.Coder
-
If this is a Coder for a parameterized type, returns the
list of Coders being used for each of the parameters, or
returns null if this cannot be done or this is not a
parameterized type.
- getCoderArguments() - Method in class com.google.cloud.dataflow.sdk.coders.IterableLikeCoder
-
- getCoderArguments() - Method in class com.google.cloud.dataflow.sdk.coders.KvCoder
-
- getCoderArguments() - Method in class com.google.cloud.dataflow.sdk.coders.MapCoder
-
- getCoderArguments() - Method in class com.google.cloud.dataflow.sdk.transforms.join.CoGbkResult.CoGbkResultCoder
-
- getCoderArguments() - Method in class com.google.cloud.dataflow.sdk.values.TimestampedValue.TimestampedValueCoder
-
- getCoderRegistry() - Method in class com.google.cloud.dataflow.sdk.Pipeline
-
- getCoderRegistry() - Method in class com.google.cloud.dataflow.sdk.transforms.PTransform
-
Deprecated.
use pipeline.getCoderRegistry()
- getCoGbkResultSchema() - Method in class com.google.cloud.dataflow.sdk.transforms.join.KeyedPCollectionTuple
-
Returns the CoGbkResultSchema associated with this
KeyedPCollections.
- getCollection() - Method in class com.google.cloud.dataflow.sdk.transforms.join.KeyedPCollectionTuple.TaggedKeyedPCollection
-
Returns the underlying PCollection of this TaggedKeyedPCollection.
- getComponents() - Method in class com.google.cloud.dataflow.sdk.coders.StandardCoder
-
Returns the list of Coders that are components of this
Coder.
- getComponents() - Method in class com.google.cloud.dataflow.sdk.transforms.join.CoGbkResult.CoGbkResultCoder
-
- getCompressionType() - Method in class com.google.cloud.dataflow.sdk.io.TextIO.Read.Bound
-
- getCreateDisposition() - Method in class com.google.cloud.dataflow.sdk.io.BigQueryIO.Write.Bound
-
Returns the create disposition.
- getCredentialDir() - Method in interface com.google.cloud.dataflow.sdk.options.GcpOptions
-
Directory for storing dataflow credentials.
- getCredentialFactoryClass() - Method in interface com.google.cloud.dataflow.sdk.options.GcpOptions
-
- getCredentialId() - Method in interface com.google.cloud.dataflow.sdk.options.GcpOptions
-
- getCurrent() - Method in class com.google.cloud.dataflow.sdk.io.DatastoreIO.DatastoreReader
-
- getCurrent() - Method in interface com.google.cloud.dataflow.sdk.io.Source.Reader
-
- getCurrent() - Method in class com.google.cloud.dataflow.sdk.runners.TransformHierarchy
-
Returns the last TransformTreeNode on the stack.
- getDataflowClient() - Method in interface com.google.cloud.dataflow.sdk.options.DataflowPipelineOptions
-
Alternative Dataflow client.
- getDataflowClient() - Method in class com.google.cloud.dataflow.sdk.runners.DataflowPipelineJob
-
- getDataflowEndpoint() - Method in interface com.google.cloud.dataflow.sdk.options.DataflowPipelineDebugOptions
-
Dataflow endpoint to use.
- getDataflowJobFile() - Method in interface com.google.cloud.dataflow.sdk.options.DataflowPipelineDebugOptions
-
The path to write the translated Dataflow specification out to
at job submission time.
- getDefaultCoder(TypeToken<T>) - Method in class com.google.cloud.dataflow.sdk.coders.CoderRegistry
-
Returns the Coder to use by default for values of the given type,
or null if there is no default Coder.
- getDefaultCoder(TypeToken<O>, TypeToken<I>, Coder<I>) - Method in class com.google.cloud.dataflow.sdk.coders.CoderRegistry
-
Returns the Coder to use by default for values of the given type,
where the given context type uses the given context coder,
or null if there is no default Coder.
- getDefaultCoder(Class<? extends T>, Class<T>, Coder<?>...) - Method in class com.google.cloud.dataflow.sdk.coders.CoderRegistry
-
Returns the Coder to use for the last type parameter specialization
of the subclass given Coders to use for all other type parameters
specializations (if any).
- getDefaultCoder(Class<? extends T>, Class<T>, Map<String, ? extends Coder<?>>, String) - Method in class com.google.cloud.dataflow.sdk.coders.CoderRegistry
-
Returns the Coder to use for the specified type parameter specialization
of the subclass, given Coders to use for all other type parameters
(if any).
- getDefaultCoder(T) - Method in class com.google.cloud.dataflow.sdk.coders.CoderRegistry
-
Returns the Coder to use for the provided example value, if it can
be determined, otherwise returns null.
- getDefaultCoders(Class<? extends T>, Class<T>, Map<String, ? extends Coder<?>>) - Method in class com.google.cloud.dataflow.sdk.coders.CoderRegistry
-
Returns a Map from each of baseClass's type parameters to the Coder to
use by default for it, in the context of subClass's specialization of
baseClass.
- getDefaultCoders(Class<? extends T>, Class<T>, Coder<?>[]) - Method in class com.google.cloud.dataflow.sdk.coders.CoderRegistry
-
Returns an array listing, for each of baseClass's type parameters, the
Coder to use by default for it, in the context of subClass's specialization
of baseClass.
- getDefaultName() - Method in class com.google.cloud.dataflow.sdk.transforms.ParDo.Bound
-
- getDefaultName() - Method in class com.google.cloud.dataflow.sdk.transforms.ParDo.BoundMulti
-
- getDefaultName() - Method in class com.google.cloud.dataflow.sdk.transforms.PTransform
-
Returns the name to use by default for this PTransform
(not including the names of any enclosing PTransforms).
- getDefaultOutputCoder(SerializableFunction<I, O>, Coder<I>) - Method in class com.google.cloud.dataflow.sdk.coders.CoderRegistry
-
Returns the Coder to use on elements produced by this function, given
the coder used for its input elements.
- getDefaultOutputCoder() - Method in class com.google.cloud.dataflow.sdk.io.AvroIO.Read.Bound
-
- getDefaultOutputCoder() - Method in class com.google.cloud.dataflow.sdk.io.AvroIO.Write.Bound
-
- getDefaultOutputCoder() - Method in class com.google.cloud.dataflow.sdk.io.BigQueryIO.Read.Bound
-
- getDefaultOutputCoder() - Method in class com.google.cloud.dataflow.sdk.io.BigQueryIO.Write.Bound
-
- getDefaultOutputCoder() - Method in class com.google.cloud.dataflow.sdk.io.DatastoreIO.Sink
-
- getDefaultOutputCoder() - Method in class com.google.cloud.dataflow.sdk.io.DatastoreIO.Source
-
- getDefaultOutputCoder() - Method in class com.google.cloud.dataflow.sdk.io.PubsubIO.Read.Bound
-
- getDefaultOutputCoder() - Method in class com.google.cloud.dataflow.sdk.io.PubsubIO.Write.Bound
-
- getDefaultOutputCoder() - Method in class com.google.cloud.dataflow.sdk.io.ReadSource.Bound
-
- getDefaultOutputCoder() - Method in class com.google.cloud.dataflow.sdk.io.Source
-
Returns the default Coder to use for the data read from this source.
- getDefaultOutputCoder() - Method in class com.google.cloud.dataflow.sdk.io.TextIO.Read.Bound
-
- getDefaultOutputCoder() - Method in class com.google.cloud.dataflow.sdk.io.TextIO.Write.Bound
-
- getDefaultOutputCoder(CoderRegistry, Coder<VI>) - Method in class com.google.cloud.dataflow.sdk.transforms.Combine.CombineFn
-
Returns the Coder to use by default for output
VO values, or null if it is not able to be inferred.
- getDefaultOutputCoder() - Method in class com.google.cloud.dataflow.sdk.transforms.Combine.GroupedValues
-
- getDefaultOutputCoder(CoderRegistry, Coder<K>, Coder<VI>) - Method in class com.google.cloud.dataflow.sdk.transforms.Combine.KeyedCombineFn
-
Returns the Coder to use by default for output
VO values, or null if it is not able to be inferred.
- getDefaultOutputCoder() - Method in class com.google.cloud.dataflow.sdk.transforms.Create
-
- getDefaultOutputCoder() - Method in class com.google.cloud.dataflow.sdk.transforms.Flatten.FlattenPCollectionList
-
- getDefaultOutputCoder() - Method in class com.google.cloud.dataflow.sdk.transforms.GroupByKey.GroupByKeyOnly
-
- getDefaultOutputCoder() - Method in class com.google.cloud.dataflow.sdk.transforms.ParDo.Bound
-
- getDefaultOutputCoder() - Method in class com.google.cloud.dataflow.sdk.transforms.ParDo.BoundMulti
-
- getDefaultOutputCoder() - Method in class com.google.cloud.dataflow.sdk.transforms.PTransform
-
Returns the default Coder to use for the output of this
single-output PTransform, or null if
none can be inferred.
- getDefaultOutputCoder(TypedPValue<T>) - Method in class com.google.cloud.dataflow.sdk.transforms.PTransform
-
Returns the default Coder to use for the given output of
this single-output PTransform, or null
if none can be inferred.
- getDefaultOutputCoder(CoderRegistry, Coder<T>) - Method in class com.google.cloud.dataflow.sdk.transforms.Sample.FixedSizedSampleFn
-
- getDefaultOutputCoder() - Method in class com.google.cloud.dataflow.sdk.transforms.windowing.Window.Bound
-
- getDefaultWorkerLogLevel() - Method in interface com.google.cloud.dataflow.sdk.options.DataflowWorkerLoggingOptions
-
This option controls the default log level of all loggers without a
log level override.
- getDiskSizeGb() - Method in interface com.google.cloud.dataflow.sdk.options.DataflowPipelineWorkerPoolOptions
-
Remote worker disk size, in gigabytes, or 0 to use the default size.
- getDiskSourceImage() - Method in interface com.google.cloud.dataflow.sdk.options.DataflowPipelineWorkerPoolOptions
-
Disk source image to use by VMs for jobs.
- getDropLateData() - Method in class com.google.cloud.dataflow.sdk.io.PubsubIO.Read.Bound
-
- getDropLateDataExplicit() - Method in class com.google.cloud.dataflow.sdk.io.PubsubIO.Read.Bound
-
- getElemCoder() - Method in class com.google.cloud.dataflow.sdk.coders.IterableLikeCoder
-
- getElements() - Method in class com.google.cloud.dataflow.sdk.transforms.Create
-
- getEnclosingNode() - Method in class com.google.cloud.dataflow.sdk.runners.TransformTreeNode
-
Returns the enclosing composite transform node, or null if there is none.
- getEncodedElementByteSize(Integer, Coder.Context) - Method in class com.google.cloud.dataflow.sdk.coders.BigEndianIntegerCoder
-
- getEncodedElementByteSize(Long, Coder.Context) - Method in class com.google.cloud.dataflow.sdk.coders.BigEndianLongCoder
-
- getEncodedElementByteSize(byte[], Coder.Context) - Method in class com.google.cloud.dataflow.sdk.coders.ByteArrayCoder
-
- getEncodedElementByteSize(Double, Coder.Context) - Method in class com.google.cloud.dataflow.sdk.coders.DoubleCoder
-
- getEncodedElementByteSize(DatastoreV1.Entity, Coder.Context) - Method in class com.google.cloud.dataflow.sdk.coders.EntityCoder
-
- getEncodedElementByteSize(T, Coder.Context) - Method in class com.google.cloud.dataflow.sdk.coders.StandardCoder
-
Returns the size in bytes of the encoded value using this
coder.
- getEncodedElementByteSize(String, Coder.Context) - Method in class com.google.cloud.dataflow.sdk.coders.StringUtf8Coder
-
- getEncodedElementByteSize(TableRow, Coder.Context) - Method in class com.google.cloud.dataflow.sdk.coders.TableRowJsonCoder
-
- getEncodedElementByteSize(Integer, Coder.Context) - Method in class com.google.cloud.dataflow.sdk.coders.TextualIntegerCoder
-
- getEncodedElementByteSize(Integer, Coder.Context) - Method in class com.google.cloud.dataflow.sdk.coders.VarIntCoder
-
- getEncodedElementByteSize(Long, Coder.Context) - Method in class com.google.cloud.dataflow.sdk.coders.VarLongCoder
-
- getEncodedElementByteSize(Void, Coder.Context) - Method in class com.google.cloud.dataflow.sdk.coders.VoidCoder
-
- getEstimatedSizeBytes(PipelineOptions) - Method in class com.google.cloud.dataflow.sdk.io.DatastoreIO.Source
-
- getEstimatedSizeBytes(PipelineOptions) - Method in class com.google.cloud.dataflow.sdk.io.Source
-
An estimate of the total size (in bytes) of the data that would be read from this source.
- getExecutorService() - Method in interface com.google.cloud.dataflow.sdk.options.GcsOptions
-
- getExpandedOutputs() - Method in class com.google.cloud.dataflow.sdk.runners.TransformTreeNode
-
Returns the transform outputs, in expanded form.
- getExperiments() - Method in interface com.google.cloud.dataflow.sdk.options.DataflowPipelineDebugOptions
-
The list of backend experiments to enable.
- getFilenamePrefix() - Method in class com.google.cloud.dataflow.sdk.io.AvroIO.Write.Bound
-
- getFilenamePrefix() - Method in class com.google.cloud.dataflow.sdk.io.TextIO.Write.Bound
-
- getFilenameSuffix() - Method in class com.google.cloud.dataflow.sdk.io.AvroIO.Write.Bound
-
- getFilenameSuffix() - Method in class com.google.cloud.dataflow.sdk.io.TextIO.Write.Bound
-
- getFilepattern() - Method in class com.google.cloud.dataflow.sdk.io.AvroIO.Read.Bound
-
- getFilepattern() - Method in class com.google.cloud.dataflow.sdk.io.TextIO.Read.Bound
-
- getFilesToStage() - Method in interface com.google.cloud.dataflow.sdk.options.DataflowPipelineWorkerPoolOptions
-
List of local files to make available to workers.
- getFixedSizeCoder(Duration) - Static method in class com.google.cloud.dataflow.sdk.transforms.windowing.IntervalWindow
-
Returns a Coder for encoding interval windows of fixed size (which
is more efficient than
IntervalWindow.getCoder() as it only needs to encode
one endpoint).
- getFn() - Method in class com.google.cloud.dataflow.sdk.transforms.Combine.GroupedValues
-
Returns the KeyedCombineFn used by this Combine operation.
- getFn() - Method in class com.google.cloud.dataflow.sdk.transforms.Combine.PerKey
-
Returns the KeyedCombineFn used by this Combine operation.
- getFn() - Method in class com.google.cloud.dataflow.sdk.transforms.ParDo.Bound
-
- getFn() - Method in class com.google.cloud.dataflow.sdk.transforms.ParDo.BoundMulti
-
- getFullName(PTransform<?, ?>) - Method in class com.google.cloud.dataflow.sdk.Pipeline
-
Returns the fully qualified name of a transform.
- getFullName() - Method in class com.google.cloud.dataflow.sdk.runners.TransformTreeNode
-
- getGCloudPath() - Method in interface com.google.cloud.dataflow.sdk.options.GcpOptions
-
- getGcpCredential() - Method in interface com.google.cloud.dataflow.sdk.options.GcpOptions
-
Alternative Google Cloud Platform Credential.
- getGcsUtil() - Method in interface com.google.cloud.dataflow.sdk.options.GcsOptions
-
Alternative GcsUtil instance.
- getGoogleApiTrace() - Method in interface com.google.cloud.dataflow.sdk.options.GoogleApiDebugOptions
-
This option enables tracing of API calls to Google services used within the Dataflow SDK.
- getId() - Method in class com.google.cloud.dataflow.sdk.values.TupleTag
-
Returns the id of this TupleTag.
- getIdLabel() - Method in class com.google.cloud.dataflow.sdk.io.PubsubIO.Read.Bound
-
- getIdLabel() - Method in class com.google.cloud.dataflow.sdk.io.PubsubIO.Write.Bound
-
- getIndex(TupleTag<?>) - Method in class com.google.cloud.dataflow.sdk.transforms.join.CoGbkResultSchema
-
Returns the index for the given tuple tag, if the tag is present in this
schema, -1 if it isn't.
- getInput(PTransform<?, ?>) - Method in class com.google.cloud.dataflow.sdk.Pipeline
-
Returns the input associated with a transform.
- getInput() - Method in class com.google.cloud.dataflow.sdk.runners.TransformTreeNode
-
Returns the transform input, in unexpanded form.
- getInput() - Method in class com.google.cloud.dataflow.sdk.transforms.PTransform
-
Returns the input of this transform.
- getInputs() - Method in class com.google.cloud.dataflow.sdk.runners.TransformTreeNode
-
Returns a mapping of inputs to the producing nodes for all inputs to
the transform.
- getInstanceComponents(T) - Static method in class com.google.cloud.dataflow.sdk.coders.AtomicCoder
-
Returns a list of values contained in the provided example
value, one per type parameter.
- getInstanceComponents(Object) - Method in class com.google.cloud.dataflow.sdk.coders.CoderRegistry.CoderFactory
-
Returns a list of objects contained in value, one per
type argument, or null if none can be determined.
- getInstanceComponents(Object) - Method in class com.google.cloud.dataflow.sdk.coders.CoderRegistry.ConstantCoderFactory
-
- getInstanceComponents(Collection<T>) - Static method in class com.google.cloud.dataflow.sdk.coders.CollectionCoder
-
Returns the first element in this collection if it is non-empty,
otherwise returns null.
- getInstanceComponents(Iterable<T>) - Static method in class com.google.cloud.dataflow.sdk.coders.IterableCoder
-
Returns the first element in this iterable if it is non-empty,
otherwise returns null.
- getInstanceComponents(KV<K, V>) - Static method in class com.google.cloud.dataflow.sdk.coders.KvCoder
-
- getInstanceComponents(List<T>) - Static method in class com.google.cloud.dataflow.sdk.coders.ListCoder
-
Returns the first element in this list if it is non-empty,
otherwise returns null.
- getInstanceComponents(Map<K, V>) - Static method in class com.google.cloud.dataflow.sdk.coders.MapCoder
-
Returns the key and value for an arbitrary element of this map,
if it is non-empty, otherwise returns null.
- getInstanceComponents(Set<T>) - Static method in class com.google.cloud.dataflow.sdk.coders.SetCoder
-
Returns the first element in this set if it is non-empty,
otherwise returns null.
- getInstanceComponents(TimestampedValue<T>) - Static method in class com.google.cloud.dataflow.sdk.values.TimestampedValue.TimestampedValueCoder
-
- getInstanceComponentsHelper(IT) - Static method in class com.google.cloud.dataflow.sdk.coders.IterableLikeCoder
-
Returns the first element in this iterable-like if it is non-empty,
otherwise returns null.
- getJobId() - Method in interface com.google.cloud.dataflow.sdk.options.DataflowWorkerHarnessOptions
-
ID of the job this pipeline represents.
- getJobId() - Method in class com.google.cloud.dataflow.sdk.runners.DataflowPipelineJob
-
- getJobMessageOutput() - Method in interface com.google.cloud.dataflow.sdk.options.BlockingDataflowPipelineOptions
-
Output stream for job status messages.
- getJobName() - Method in interface com.google.cloud.dataflow.sdk.options.DataflowPipelineOptions
-
The job name is used as an idempotence key within the Dataflow service.
- getJobState() - Method in class com.google.cloud.dataflow.sdk.runners.BlockingDataflowPipelineRunner.PipelineJobState
-
- getKey() - Method in class com.google.cloud.dataflow.sdk.runners.DirectPipelineRunner.ValueWithMetadata
-
Returns the key associated with this element.
- getKey() - Method in class com.google.cloud.dataflow.sdk.values.KV
-
Returns the key of this KV.
- getKeyCoder() - Method in class com.google.cloud.dataflow.sdk.coders.KvCoder
-
- getKeyCoder() - Method in class com.google.cloud.dataflow.sdk.coders.MapCoder
-
- getKeyCoder() - Method in class com.google.cloud.dataflow.sdk.transforms.join.KeyedPCollectionTuple
-
Returns the KeyCoder for all PCollections in this KeyedPCollections.
- getKeyedCollections() - Method in class com.google.cloud.dataflow.sdk.transforms.join.KeyedPCollectionTuple
-
Returns a list of TaggedKeyedPCollections for the PCollections contained in
this KeyedPCollections<K>.
- getKindString() - Method in class com.google.cloud.dataflow.sdk.io.AvroIO.Read.Bound
-
- getKindString() - Method in class com.google.cloud.dataflow.sdk.io.AvroIO.Write.Bound
-
- getKindString() - Method in class com.google.cloud.dataflow.sdk.io.BigQueryIO.Read.Bound
-
- getKindString() - Method in class com.google.cloud.dataflow.sdk.io.BigQueryIO.Write.Bound
-
- getKindString() - Method in class com.google.cloud.dataflow.sdk.io.DatastoreIO.Sink
-
- getKindString() - Method in class com.google.cloud.dataflow.sdk.io.PubsubIO.Read.Bound
-
- getKindString() - Method in class com.google.cloud.dataflow.sdk.io.PubsubIO.Write.Bound
-
- getKindString() - Method in class com.google.cloud.dataflow.sdk.io.TextIO.Read.Bound
-
- getKindString() - Method in class com.google.cloud.dataflow.sdk.io.TextIO.Write.Bound
-
- getKindString() - Method in class com.google.cloud.dataflow.sdk.transforms.Combine.Globally
-
- getKindString() - Method in class com.google.cloud.dataflow.sdk.transforms.Combine.PerKey
-
- getKindString() - Method in class com.google.cloud.dataflow.sdk.transforms.Combine.PerKeyWithHotKeys
-
- getKindString() - Method in class com.google.cloud.dataflow.sdk.transforms.Count.PerElement
-
- getKindString() - Method in class com.google.cloud.dataflow.sdk.transforms.ParDo.Bound
-
- getKindString() - Method in class com.google.cloud.dataflow.sdk.transforms.ParDo.BoundMulti
-
- getKindString() - Method in class com.google.cloud.dataflow.sdk.transforms.PTransform
-
Returns a string describing what kind of PTransform this is.
- getKindString() - Method in class com.google.cloud.dataflow.sdk.transforms.windowing.Window.Bound
-
- getKindString() - Method in class com.google.cloud.dataflow.sdk.values.PValueBase
-
Returns a String capturing the kind of this
PValueBase.
- getLevel() - Method in class com.google.cloud.dataflow.sdk.options.DataflowWorkerLoggingOptions.WorkerLogLevelOverride
-
- getMainOutputTag() - Method in class com.google.cloud.dataflow.sdk.transforms.ParDo.BoundMulti
-
- getMaxNumWorkers() - Method in interface com.google.cloud.dataflow.sdk.options.DataflowPipelineWorkerPoolOptions
-
Max number of workers to use when using workerpool autoscaling.
- getMessage() - Method in class com.google.cloud.dataflow.sdk.coders.Coder.NonDeterministicException
-
- getName() - Method in class com.google.cloud.dataflow.sdk.options.DataflowWorkerLoggingOptions.WorkerLogLevelOverride
-
- getName() - Method in class com.google.cloud.dataflow.sdk.transforms.PTransform
-
Returns the transform name.
- getName() - Method in class com.google.cloud.dataflow.sdk.values.PCollection
-
Returns the name of this PCollection.
- getName() - Method in interface com.google.cloud.dataflow.sdk.values.PValue
-
- getName() - Method in class com.google.cloud.dataflow.sdk.values.PValueBase
-
Returns the name of this PValueBase.
- getNode(PTransform<?, ?>) - Method in class com.google.cloud.dataflow.sdk.runners.TransformHierarchy
-
Returns the TransformTreeNode associated with a given transform.
- getNumShards() - Method in class com.google.cloud.dataflow.sdk.io.AvroIO.Write.Bound
-
- getNumShards() - Method in class com.google.cloud.dataflow.sdk.io.TextIO.Write.Bound
-
- getNumWorkers() - Method in interface com.google.cloud.dataflow.sdk.options.DataflowPipelineWorkerPoolOptions
-
Number of workers to use in remote execution.
- getOnly(TupleTag<V>) - Method in class com.google.cloud.dataflow.sdk.transforms.join.CoGbkResult
-
If there is a singleton value for the given tag, returns it.
- getOnly(TupleTag<V>, V) - Method in class com.google.cloud.dataflow.sdk.transforms.join.CoGbkResult
-
If there is a singleton value for the given tag, returns it.
- getOptions() - Method in class com.google.cloud.dataflow.sdk.Pipeline
-
Returns the configured pipeline options.
- getOriginalWindowFn() - Method in class com.google.cloud.dataflow.sdk.transforms.windowing.InvalidWindows
-
Returns the original windowFn that this InvalidWindows replaced.
- getOutName(int) - Method in class com.google.cloud.dataflow.sdk.values.TupleTag
-
If this TupleTag is tagging output outputIndex of
a PTransform, returns the name that should be used by
default for the output.
- getOutput(PTransform<?, ?>) - Method in class com.google.cloud.dataflow.sdk.Pipeline
-
Returns the output associated with a transform.
- getOutput() - Method in class com.google.cloud.dataflow.sdk.runners.TransformTreeNode
-
Returns the transform output, in unexpanded form.
- getOutput() - Method in class com.google.cloud.dataflow.sdk.transforms.PTransform
-
Returns the output of this transform.
- getPathValidator() - Method in interface com.google.cloud.dataflow.sdk.options.DataflowPipelineDebugOptions
-
The validator class used to validate path names.
- getPathValidatorClass() - Method in interface com.google.cloud.dataflow.sdk.options.DataflowPipelineDebugOptions
-
The name of the validator class used to validate path names.
- getPCollection(PCollection<T>) - Method in interface com.google.cloud.dataflow.sdk.runners.DirectPipelineRunner.EvaluationResults
-
Retrieves the value of the given PCollection.
- getPCollectionList(PCollectionList<T>) - Method in interface com.google.cloud.dataflow.sdk.runners.DirectPipelineRunner.EvaluationResults
-
Retrieves the values of each PCollection in the given
PCollectionList.
- getPCollectionValuesWithMetadata(PCollection<T>) - Method in interface com.google.cloud.dataflow.sdk.runners.DirectPipelineRunner.EvaluationContext
-
Retrieves the value of the given PCollection, along with element metadata
such as timestamps and windows.
- getPCollectionView(PCollectionView<T, WT>) - Method in interface com.google.cloud.dataflow.sdk.runners.DirectPipelineRunner.EvaluationResults
-
- getPCollectionWindowedValues(PCollection<T>) - Method in interface com.google.cloud.dataflow.sdk.runners.DirectPipelineRunner.EvaluationResults
-
Retrieves the windowed value of the given PCollection.
- getPipeline() - Method in class com.google.cloud.dataflow.sdk.transforms.join.KeyedPCollectionTuple
-
- getPipeline() - Method in class com.google.cloud.dataflow.sdk.transforms.PTransform
-
Deprecated.
- getPipeline() - Method in class com.google.cloud.dataflow.sdk.values.PBegin
-
- getPipeline() - Method in class com.google.cloud.dataflow.sdk.values.PCollectionList
-
- getPipeline() - Method in class com.google.cloud.dataflow.sdk.values.PCollectionTuple
-
- getPipeline() - Method in interface com.google.cloud.dataflow.sdk.values.PInput
-
Returns the owning Pipeline of this PInput.
- getPipeline() - Method in class com.google.cloud.dataflow.sdk.values.PValueBase
-
Returns the owning Pipeline of this PValueBase.
- getPipelineOptions() - Method in interface com.google.cloud.dataflow.sdk.options.PipelineOptionsRegistrar
-
- getPipelineOptions() - Method in class com.google.cloud.dataflow.sdk.runners.DataflowPipelineRegistrar.Options
-
- getPipelineOptions() - Method in interface com.google.cloud.dataflow.sdk.runners.DataflowPipelineTranslator.TranslationContext
-
Returns the configured pipeline options.
- getPipelineOptions() - Method in class com.google.cloud.dataflow.sdk.runners.DirectPipelineRegistrar.Options
-
- getPipelineOptions() - Method in interface com.google.cloud.dataflow.sdk.runners.DirectPipelineRunner.EvaluationContext
-
Returns the configured pipeline options.
- getPipelineOptions() - Method in class com.google.cloud.dataflow.sdk.runners.DirectPipelineRunner
-
- getPipelineOptions() - Method in class com.google.cloud.dataflow.sdk.transforms.DoFn.Context
-
Returns the
PipelineOptions specified with the
PipelineRunner
invoking this
DoFn.
- getPipelineRunners() - Method in class com.google.cloud.dataflow.sdk.runners.DataflowPipelineRegistrar.Runner
-
- getPipelineRunners() - Method in class com.google.cloud.dataflow.sdk.runners.DirectPipelineRegistrar.Runner
-
- getPipelineRunners() - Method in interface com.google.cloud.dataflow.sdk.runners.PipelineRunnerRegistrar
-
- getProducingTransformInternal() - Method in class com.google.cloud.dataflow.sdk.values.POutputValueBase
-
Returns the PTransform that this POutputValueBase
is an output of.
- getProducingTransformInternal() - Method in interface com.google.cloud.dataflow.sdk.values.PValue
-
Returns the PTransform that this PValue is an output of.
- getProject() - Method in interface com.google.cloud.dataflow.sdk.options.GcpOptions
-
Project id to use when launching jobs.
- getProjectId() - Method in class com.google.cloud.dataflow.sdk.runners.DataflowPipelineJob
-
- getReasons() - Method in class com.google.cloud.dataflow.sdk.coders.Coder.NonDeterministicException
-
- getRecordType() - Method in class com.google.cloud.dataflow.sdk.coders.SerializableCoder
-
- getRegisteredOptions() - Static method in class com.google.cloud.dataflow.sdk.options.PipelineOptionsFactory
-
- getRunner() - Method in interface com.google.cloud.dataflow.sdk.options.PipelineOptions
-
- getRunner() - Method in class com.google.cloud.dataflow.sdk.Pipeline
-
Returns the configured pipeline runner.
- getRunner() - Method in class com.google.cloud.dataflow.sdk.runners.DataflowPipeline
-
- getRunner() - Method in class com.google.cloud.dataflow.sdk.runners.DirectPipeline
-
- getSchema() - Method in class com.google.cloud.dataflow.sdk.coders.AvroCoder
-
Returns the schema used by this coder.
- getSchema() - Method in class com.google.cloud.dataflow.sdk.io.AvroIO.Read.Bound
-
- getSchema() - Method in class com.google.cloud.dataflow.sdk.io.AvroIO.Write.Bound
-
- getSchema() - Method in class com.google.cloud.dataflow.sdk.io.BigQueryIO.Write.Bound
-
Returns the table schema.
- getSchema() - Method in class com.google.cloud.dataflow.sdk.transforms.join.CoGbkResult
-
Returns the schema used by this CoGbkResult.
- getSecretsFile() - Method in interface com.google.cloud.dataflow.sdk.options.GcpOptions
-
This option controls which file to use when attempting to create the credentials using the
OAuth 2 webflow.
- getServiceAccountKeyfile() - Method in interface com.google.cloud.dataflow.sdk.options.GcpOptions
-
This option controls which file to use when attempting to create the credentials using the
service account method.
- getServiceAccountName() - Method in interface com.google.cloud.dataflow.sdk.options.GcpOptions
-
This option controls which service account to use when attempting to create the credentials
using the service account method.
- getShardNameTemplate() - Method in class com.google.cloud.dataflow.sdk.io.AvroIO.Write.Bound
-
Returns the current shard name template string.
- getShardNameTemplate() - Method in class com.google.cloud.dataflow.sdk.io.TextIO.Write.Bound
-
Returns the current shard name template string.
- getShardTemplate() - Method in class com.google.cloud.dataflow.sdk.io.AvroIO.Write.Bound
-
- getShardTemplate() - Method in class com.google.cloud.dataflow.sdk.io.TextIO.Write.Bound
-
- getSideInputs() - Method in class com.google.cloud.dataflow.sdk.transforms.ParDo.Bound
-
- getSideInputs() - Method in class com.google.cloud.dataflow.sdk.transforms.ParDo.BoundMulti
-
- getSource() - Method in class com.google.cloud.dataflow.sdk.io.ReadSource.Bound
-
Returns the Source used to create this ReadSource PTransform.
- getStager() - Method in interface com.google.cloud.dataflow.sdk.options.DataflowPipelineDebugOptions
-
The stager instance used to stage files.
- getStagerClass() - Method in interface com.google.cloud.dataflow.sdk.options.DataflowPipelineDebugOptions
-
The class used to stage files.
- getStagingLocation() - Method in interface com.google.cloud.dataflow.sdk.options.DataflowPipelineOptions
-
GCS path for staging local files.
- getStepName(PTransform<?, ?>) - Method in interface com.google.cloud.dataflow.sdk.runners.DirectPipelineRunner.EvaluationContext
-
Gets the step name for this transform.
- getSubscription() - Method in class com.google.cloud.dataflow.sdk.io.PubsubIO.Read.Bound
-
- getTable() - Method in class com.google.cloud.dataflow.sdk.io.BigQueryIO.Read.Bound
-
Returns the table to write.
- getTable() - Method in class com.google.cloud.dataflow.sdk.io.BigQueryIO.Write.Bound
-
Returns the table reference.
- getTag(int) - Method in class com.google.cloud.dataflow.sdk.transforms.join.CoGbkResultSchema
-
Returns the JoinTupleTag at the given index.
- getTagInternal() - Method in interface com.google.cloud.dataflow.sdk.values.PCollectionView
-
A unique identifier, for internal use.
- getTeardownPolicy() - Method in interface com.google.cloud.dataflow.sdk.options.DataflowPipelineWorkerPoolOptions
-
Teardown policy for the VMs.
- getTeardownPolicyName() - Method in enum com.google.cloud.dataflow.sdk.options.DataflowPipelineWorkerPoolOptions.TeardownPolicy
-
- getTempDatasetId() - Method in interface com.google.cloud.dataflow.sdk.options.BigQueryOptions
-
- getTempLocation() - Method in interface com.google.cloud.dataflow.sdk.options.DataflowPipelineOptions
-
GCS path for temporary files.
- getTimestamp() - Method in class com.google.cloud.dataflow.sdk.runners.DirectPipelineRunner.ValueWithMetadata
-
Returns the timestamp associated with this element.
- getTimestamp() - Method in class com.google.cloud.dataflow.sdk.values.TimestampedValue
-
- getTimestampLabel() - Method in class com.google.cloud.dataflow.sdk.io.PubsubIO.Read.Bound
-
- getTimestampLabel() - Method in class com.google.cloud.dataflow.sdk.io.PubsubIO.Write.Bound
-
- getTopic() - Method in class com.google.cloud.dataflow.sdk.io.PubsubIO.Read.Bound
-
- getTopic() - Method in class com.google.cloud.dataflow.sdk.io.PubsubIO.Write.Bound
-
- getTransform() - Method in class com.google.cloud.dataflow.sdk.runners.TransformTreeNode
-
Returns the transform associated with this transform node.
- getTransformEvaluator(Class<PT>) - Method in class com.google.cloud.dataflow.sdk.runners.DirectPipelineRunner
-
Returns the TransformEvaluator to use for instances of the
specified PTransform class, or null if none registered.
- getTransformTranslator(Class<PT>) - Method in class com.google.cloud.dataflow.sdk.runners.DataflowPipelineTranslator
-
Returns the TransformTranslator to use for instances of the
specified PTransform class, or null if none registered.
- getTranslator() - Method in class com.google.cloud.dataflow.sdk.runners.DataflowPipelineRunner
-
Returns the DataflowPipelineTranslator associated with this object.
- getTupleTag() - Method in class com.google.cloud.dataflow.sdk.transforms.join.KeyedPCollectionTuple.TaggedKeyedPCollection
-
Returns the TupleTag of this TaggedKeyedPCollection.
- getTupleTagList() - Method in class com.google.cloud.dataflow.sdk.transforms.join.CoGbkResultSchema
-
Returns the TupleTagList tuple associated with this schema.
- getType() - Method in class com.google.cloud.dataflow.sdk.io.AvroIO.Write.Bound
-
- getTypeToken() - Method in class com.google.cloud.dataflow.sdk.values.TupleTag
-
Returns a TypeToken capturing what is known statically
about the type of this TupleTag instance's most-derived
class.
- getTypeToken() - Method in class com.google.cloud.dataflow.sdk.values.TypedPValue
-
Returns a TypeToken<T> with some reflective information
about T, if possible.
- getUnionTag() - Method in class com.google.cloud.dataflow.sdk.transforms.join.RawUnionValue
-
- getValidate() - Method in class com.google.cloud.dataflow.sdk.io.BigQueryIO.Read.Bound
-
Returns true if table validation is enabled.
- getValidate() - Method in class com.google.cloud.dataflow.sdk.io.BigQueryIO.Write.Bound
-
Returns true if table validation is enabled.
- getValue() - Method in class com.google.cloud.dataflow.sdk.runners.DirectPipelineRunner.ValueWithMetadata
-
Returns the value associated with this element.
- getValue() - Method in class com.google.cloud.dataflow.sdk.transforms.join.RawUnionValue
-
- getValue() - Method in class com.google.cloud.dataflow.sdk.values.KV
-
Returns the value of this KV.
- getValue() - Method in class com.google.cloud.dataflow.sdk.values.TimestampedValue
-
- getValueCoder() - Method in class com.google.cloud.dataflow.sdk.coders.KvCoder
-
- getValueCoder() - Method in class com.google.cloud.dataflow.sdk.coders.MapCoder
-
- getVersion() - Method in class com.google.cloud.dataflow.sdk.options.CloudDebuggerOptions.DebuggerConfig
-
- getWindowedValue() - Method in class com.google.cloud.dataflow.sdk.runners.DirectPipelineRunner.ValueWithMetadata
-
Returns the WindowedValue associated with this element.
- getWindowFn() - Method in class com.google.cloud.dataflow.sdk.values.PCollection
-
Returns the
WindowFn of this
PCollection.
- getWindows() - Method in class com.google.cloud.dataflow.sdk.runners.DirectPipelineRunner.ValueWithMetadata
-
Returns the collection of windows this element has been placed into.
- getWorkerId() - Method in interface com.google.cloud.dataflow.sdk.options.DataflowWorkerHarnessOptions
-
ID of the worker running this pipeline.
- getWorkerLogLevelOverrides() - Method in interface com.google.cloud.dataflow.sdk.options.DataflowWorkerLoggingOptions
-
This option controls the log levels for specifically named loggers.
- getWorkerMachineType() - Method in interface com.google.cloud.dataflow.sdk.options.DataflowPipelineWorkerPoolOptions
-
Machine type to create worker VMs as.
- getWriteDisposition() - Method in class com.google.cloud.dataflow.sdk.io.BigQueryIO.Write.Bound
-
Returns the write disposition.
- getZone() - Method in interface com.google.cloud.dataflow.sdk.options.DataflowPipelineWorkerPoolOptions
-
- globally(int, C) - Static method in class com.google.cloud.dataflow.sdk.transforms.ApproximateQuantiles
-
Returns a PTransform that takes a PCollection<T>
and returns a PCollection<List<T>> whose single value is a
List of the approximate N-tiles of the elements
of the input PCollection.
- globally(int) - Static method in class com.google.cloud.dataflow.sdk.transforms.ApproximateQuantiles
-
- globally(int) - Static method in class com.google.cloud.dataflow.sdk.transforms.ApproximateUnique
-
Returns a PTransform that takes a PCollection<T>
and returns a PCollection<Long> containing a single value
that is an estimate of the number of distinct elements in the
input PCollection.
- globally(double) - Static method in class com.google.cloud.dataflow.sdk.transforms.ApproximateUnique
-
- globally(SerializableFunction<Iterable<V>, V>) - Static method in class com.google.cloud.dataflow.sdk.transforms.Combine
-
Returns a
Combine.Globally PTransform
that uses the given
SerializableFunction to combine all
the elements of the input
PCollection into a singleton
PCollection value.
- globally(Combine.CombineFn<? super VI, ?, VO>) - Static method in class com.google.cloud.dataflow.sdk.transforms.Combine
-
Returns a
Combine.Globally PTransform
that uses the given
CombineFn to combine all the elements
of the input
PCollection into a singleton
PCollection
value.
- globally() - Static method in class com.google.cloud.dataflow.sdk.transforms.Count
-
- globally() - Static method in class com.google.cloud.dataflow.sdk.transforms.Mean
-
Returns a PTransform that takes an input
PCollection<N> and returns a
PCollection<Double> whose contents is the mean of the
input PCollection's elements, or
0 if there are no elements.
- GlobalWindow - Class in com.google.cloud.dataflow.sdk.transforms.windowing
-
The default window into which all data is placed (via
GlobalWindows).
- GlobalWindow.Coder - Class in com.google.cloud.dataflow.sdk.transforms.windowing
-
Coder for encoding and decoding
Windows.
- GlobalWindows - Class in com.google.cloud.dataflow.sdk.transforms.windowing
-
Default
WindowFn where all data is in the same bucket.
- GlobalWindows() - Constructor for class com.google.cloud.dataflow.sdk.transforms.windowing.GlobalWindows
-
- GoogleApiDebugOptions - Interface in com.google.cloud.dataflow.sdk.options
-
These options configure debug settings for Google API clients generated by the Dataflow SDK.
- GoogleApiDebugOptions.GoogleApiTracer - Class in com.google.cloud.dataflow.sdk.options
-
A GoogleClientRequestInitializer which adds the 'trace' token to Google API calls.
- greaterThan(T) - Static method in class com.google.cloud.dataflow.sdk.transforms.Filter
-
Returns a PTransform that takes an input
PCollection<T> and returns a PCollection<T> with
elements that are greater than a given value, based on the
elements' natural ordering.
- greaterThanEq(T) - Static method in class com.google.cloud.dataflow.sdk.transforms.Filter
-
Returns a PTransform that takes an input
PCollection<T> and returns a PCollection<T> with
elements that are greater than or equal to a given value, based on
the elements' natural ordering.
- GroupByKey<K,V> - Class in com.google.cloud.dataflow.sdk.transforms
-
GroupByKey<K, V> takes a PCollection<KV<K, V>>,
groups the values by key and windows, and returns a
PCollection<KV<K, Iterable<V>>> representing a map from
each distinct key and window of the input PCollection to an
Iterable over all the values associated with that key in
the input.
- GroupByKey() - Constructor for class com.google.cloud.dataflow.sdk.transforms.GroupByKey
-
- GroupByKey.GroupAlsoByWindow<K,V> - Class in com.google.cloud.dataflow.sdk.transforms
-
Helper transform that takes a collection of timestamp-ordered
values associated with each key, groups the values by window,
combines windows as needed, and for each window in each key,
outputs a collection of key/value-list pairs implicitly assigned
to the window and with the timestamp derived from that window.
- GroupByKey.GroupAlsoByWindow(WindowFn<?, ?>) - Constructor for class com.google.cloud.dataflow.sdk.transforms.GroupByKey.GroupAlsoByWindow
-
- GroupByKey.GroupByKeyOnly<K,V> - Class in com.google.cloud.dataflow.sdk.transforms
-
Primitive helper transform that groups by key only, ignoring any
window assignments.
- GroupByKey.GroupByKeyOnly() - Constructor for class com.google.cloud.dataflow.sdk.transforms.GroupByKey.GroupByKeyOnly
-
- GroupByKey.ReifyTimestampsAndWindows<K,V> - Class in com.google.cloud.dataflow.sdk.transforms
-
Helper transform that makes timestamps and window assignments
explicit in the value part of each key/value pair.
- GroupByKey.ReifyTimestampsAndWindows() - Constructor for class com.google.cloud.dataflow.sdk.transforms.GroupByKey.ReifyTimestampsAndWindows
-
- GroupByKey.SortValuesByTimestamp<K,V> - Class in com.google.cloud.dataflow.sdk.transforms
-
Helper transform that sorts the values associated with each key
by timestamp.
- GroupByKey.SortValuesByTimestamp() - Constructor for class com.google.cloud.dataflow.sdk.transforms.GroupByKey.SortValuesByTimestamp
-
- groupedValues(SerializableFunction<Iterable<V>, V>) - Static method in class com.google.cloud.dataflow.sdk.transforms.Combine
-
Returns a
Combine.GroupedValues
PTransform that takes a
PCollection of
KVs where a key maps to an
Iterable of values, e.g.,
the result of a
GroupByKey, then uses the given
SerializableFunction to combine all the values associated
with a key, ignoring the key.
- groupedValues(Combine.CombineFn<? super VI, ?, VO>) - Static method in class com.google.cloud.dataflow.sdk.transforms.Combine
-
Returns a
Combine.GroupedValues
PTransform that takes a
PCollection of
KVs where a key maps to an
Iterable of values, e.g.,
the result of a
GroupByKey, then uses the given
CombineFn to combine all the values associated with a
key, ignoring the key.
- groupedValues(Combine.KeyedCombineFn<? super K, ? super VI, ?, VO>) - Static method in class com.google.cloud.dataflow.sdk.transforms.Combine
-
Returns a
Combine.GroupedValues
PTransform that takes a
PCollection of
KVs where a key maps to an
Iterable of values, e.g.,
the result of a
GroupByKey, then uses the given
KeyedCombineFn to combine all the values associated with
each key.
- idLabel(String) - Method in class com.google.cloud.dataflow.sdk.io.PubsubIO.Read.Bound
-
Returns a new TextIO.Read PTransform that's like this one but reading unique ids
from the given PubSub label.
- idLabel(String) - Static method in class com.google.cloud.dataflow.sdk.io.PubsubIO.Read
-
Creates and returns a PubSubIO.Read PTransform where unique record identifiers are
expected to be provided using the PubSub labeling API.
- idLabel(String) - Method in class com.google.cloud.dataflow.sdk.io.PubsubIO.Write.Bound
-
Returns a new TextIO.Write PTransform that's like this one but publishing record ids
to the given PubSub label.
- idLabel(String) - Static method in class com.google.cloud.dataflow.sdk.io.PubsubIO.Write
-
If specified, Dataflow will add a Pubsub label to each output record containing a unique
identifier for that record.
- in(Pipeline) - Static method in class com.google.cloud.dataflow.sdk.values.PBegin
-
Returns a PBegin in the given Pipeline.
- INDEX_OF_MAX - Static variable in class com.google.cloud.dataflow.sdk.io.ShardNameTemplate
-
Shard name containing the index and max.
- initialize(AbstractGoogleClientRequest<?>) - Method in class com.google.cloud.dataflow.sdk.options.GoogleApiDebugOptions.GoogleApiTracer
-
- INSTANCE - Static variable in class com.google.cloud.dataflow.sdk.transforms.windowing.GlobalWindow.Coder
-
- INSTANCE - Static variable in class com.google.cloud.dataflow.sdk.transforms.windowing.GlobalWindow
-
- InstantCoder - Class in com.google.cloud.dataflow.sdk.coders
-
A InstantCoder encodes joda Instant.
- integersGlobally() - Static method in class com.google.cloud.dataflow.sdk.transforms.Max
-
Returns a PTransform that takes an input
PCollection<Integer> and returns a
PCollection<Integer> whose contents is the maximum of the
input PCollection's elements, or
Integer.MIN_VALUE if there are no elements.
- integersGlobally() - Static method in class com.google.cloud.dataflow.sdk.transforms.Min
-
Returns a PTransform that takes an input
PCollection<Integer> and returns a
PCollection<Integer> whose contents is a single value that is
the minimum of the input PCollection's elements, or
Integer.MAX_VALUE if there are no elements.
- integersGlobally() - Static method in class com.google.cloud.dataflow.sdk.transforms.Sum
-
Returns a PTransform that takes an input
PCollection<Integer> and returns a
PCollection<Integer> whose contents is the sum of the
input PCollection's elements, or
0 if there are no elements.
- integersPerKey() - Static method in class com.google.cloud.dataflow.sdk.transforms.Max
-
Returns a PTransform that takes an input
PCollection<KV<K, Integer>> and returns a
PCollection<KV<K, Integer>> that contains an output
element mapping each distinct key in the input
PCollection to the maximum of the values associated with
that key in the input PCollection.
- integersPerKey() - Static method in class com.google.cloud.dataflow.sdk.transforms.Min
-
Returns a PTransform that takes an input
PCollection<KV<K, Integer>> and returns a
PCollection<KV<K, Integer>> that contains an output
element mapping each distinct key in the input
PCollection to the minimum of the values associated with
that key in the input PCollection.
- integersPerKey() - Static method in class com.google.cloud.dataflow.sdk.transforms.Sum
-
Returns a PTransform that takes an input
PCollection<KV<K, Integer>> and returns a
PCollection<KV<K, Integer>> that contains an output
element mapping each distinct key in the input
PCollection to the sum of the values associated with
that key in the input PCollection.
- intersects(IntervalWindow) - Method in class com.google.cloud.dataflow.sdk.transforms.windowing.IntervalWindow
-
Returns whether this window intersects the given window.
- IntervalWindow - Class in com.google.cloud.dataflow.sdk.transforms.windowing
-
- IntervalWindow(Instant, Instant) - Constructor for class com.google.cloud.dataflow.sdk.transforms.windowing.IntervalWindow
-
Creates a new IntervalWindow that represents the half-open time
interval [start, end).
- IntervalWindow(Instant, Duration) - Constructor for class com.google.cloud.dataflow.sdk.transforms.windowing.IntervalWindow
-
- into(WindowFn<? super T, ?>) - Static method in class com.google.cloud.dataflow.sdk.transforms.windowing.Window
-
Creates a
Window PTransform that uses the given
WindowFn to window the data.
- into(WindowFn<? super T, ?>) - Method in class com.google.cloud.dataflow.sdk.transforms.windowing.Window.Unbound
-
Returns a new
Window PTransform that's like this
transform but which will use the given
WindowFn, and which has
its input and output types bound.
- InvalidWindows<W extends BoundedWindow> - Class in com.google.cloud.dataflow.sdk.transforms.windowing
-
A
WindowFn that represents an invalid pipeline state.
- InvalidWindows(String, WindowFn<?, W>) - Constructor for class com.google.cloud.dataflow.sdk.transforms.windowing.InvalidWindows
-
- is(T) - Method in class com.google.cloud.dataflow.sdk.testing.DataflowAssert.SingletonAssert
-
Deprecated.
- isCompatible(WindowFn) - Method in class com.google.cloud.dataflow.sdk.transforms.windowing.CalendarWindows.DaysWindows
-
- isCompatible(WindowFn) - Method in class com.google.cloud.dataflow.sdk.transforms.windowing.CalendarWindows.MonthsWindows
-
- isCompatible(WindowFn) - Method in class com.google.cloud.dataflow.sdk.transforms.windowing.CalendarWindows.YearsWindows
-
- isCompatible(WindowFn<?, ?>) - Method in class com.google.cloud.dataflow.sdk.transforms.windowing.FixedWindows
-
- isCompatible(WindowFn) - Method in class com.google.cloud.dataflow.sdk.transforms.windowing.GlobalWindows
-
- isCompatible(WindowFn<?, ?>) - Method in class com.google.cloud.dataflow.sdk.transforms.windowing.InvalidWindows
-
InvalidWindows objects with the same originalWindowFn are compatible.
- isCompatible(WindowFn<?, ?>) - Method in class com.google.cloud.dataflow.sdk.transforms.windowing.Sessions
-
- isCompatible(WindowFn<?, ?>) - Method in class com.google.cloud.dataflow.sdk.transforms.windowing.SlidingWindows
-
- isCompatible(WindowFn<?, ?>) - Method in class com.google.cloud.dataflow.sdk.transforms.windowing.WindowFn
-
Returns whether this performs the same merging as the given
WindowFn.
- isCompositeNode() - Method in class com.google.cloud.dataflow.sdk.runners.TransformTreeNode
-
Returns true if this node represents a composite transform.
- isDeterministic() - Method in class com.google.cloud.dataflow.sdk.coders.AvroCoder
-
Deprecated.
- isDeterministic() - Method in class com.google.cloud.dataflow.sdk.coders.BigEndianIntegerCoder
-
- isDeterministic() - Method in class com.google.cloud.dataflow.sdk.coders.BigEndianLongCoder
-
- isDeterministic() - Method in class com.google.cloud.dataflow.sdk.coders.ByteArrayCoder
-
- isDeterministic() - Method in interface com.google.cloud.dataflow.sdk.coders.Coder
-
- isDeterministic() - Method in class com.google.cloud.dataflow.sdk.coders.CustomCoder
-
Deprecated.
- isDeterministic() - Method in class com.google.cloud.dataflow.sdk.coders.DelegateCoder
-
Deprecated.
- isDeterministic() - Method in class com.google.cloud.dataflow.sdk.coders.DoubleCoder
-
Deprecated.
- isDeterministic() - Method in class com.google.cloud.dataflow.sdk.coders.EntityCoder
-
Deprecated.
- isDeterministic() - Method in class com.google.cloud.dataflow.sdk.coders.InstantCoder
-
Deprecated.
- isDeterministic() - Method in class com.google.cloud.dataflow.sdk.coders.IterableLikeCoder
-
Deprecated.
- isDeterministic() - Method in class com.google.cloud.dataflow.sdk.coders.KvCoder
-
Deprecated.
- isDeterministic() - Method in class com.google.cloud.dataflow.sdk.coders.ListCoder
-
Deprecated.
- isDeterministic() - Method in class com.google.cloud.dataflow.sdk.coders.MapCoder
-
Deprecated.
- isDeterministic() - Method in class com.google.cloud.dataflow.sdk.coders.SerializableCoder
-
Deprecated.
- isDeterministic() - Method in class com.google.cloud.dataflow.sdk.coders.SetCoder
-
Deprecated.
- isDeterministic() - Method in class com.google.cloud.dataflow.sdk.coders.StringUtf8Coder
-
Deprecated.
- isDeterministic() - Method in class com.google.cloud.dataflow.sdk.coders.TableRowJsonCoder
-
Deprecated.
- isDeterministic() - Method in class com.google.cloud.dataflow.sdk.coders.TextualIntegerCoder
-
Deprecated.
- isDeterministic() - Method in class com.google.cloud.dataflow.sdk.coders.VarIntCoder
-
Deprecated.
- isDeterministic() - Method in class com.google.cloud.dataflow.sdk.coders.VarLongCoder
-
Deprecated.
- isDeterministic() - Method in class com.google.cloud.dataflow.sdk.coders.VoidCoder
-
Deprecated.
- isDeterministic() - Method in class com.google.cloud.dataflow.sdk.transforms.join.CoGbkResult.CoGbkResultCoder
-
Deprecated.
- isDeterministic() - Method in class com.google.cloud.dataflow.sdk.transforms.windowing.GlobalWindow.Coder
-
- isDeterministic() - Method in class com.google.cloud.dataflow.sdk.values.TimestampedValue.TimestampedValueCoder
-
Deprecated.
- isDisjoint(IntervalWindow) - Method in class com.google.cloud.dataflow.sdk.transforms.windowing.IntervalWindow
-
Returns whether this window is disjoint from the given window.
- isEmpty() - Method in class com.google.cloud.dataflow.sdk.transforms.join.CoGbkResult
-
- isEmpty() - Method in class com.google.cloud.dataflow.sdk.transforms.join.KeyedPCollectionTuple
-
- isEqualTo(T) - Method in class com.google.cloud.dataflow.sdk.testing.DataflowAssert.SingletonAssert
-
Checks that the value of this SingletonAssert's view is equal
to the expected value.
- isFinishedSpecifyingInternal() - Method in class com.google.cloud.dataflow.sdk.values.PValueBase
-
Returns whether this PValueBase has been finalized, and
its core properties, e.g., name, can no longer be changed.
- isOrdered() - Method in class com.google.cloud.dataflow.sdk.values.PCollection
-
Returns whether or not the elements of this PCollection have a
well-defined and fixed order, such that subsequent reading of the
PCollection is guaranteed to process the elements in order.
- isRegisterByteSizeObserverCheap(Integer, Coder.Context) - Method in class com.google.cloud.dataflow.sdk.coders.BigEndianIntegerCoder
-
Returns true since registerByteSizeObserver() runs in constant time.
- isRegisterByteSizeObserverCheap(Long, Coder.Context) - Method in class com.google.cloud.dataflow.sdk.coders.BigEndianLongCoder
-
Returns true since registerByteSizeObserver() runs in constant time.
- isRegisterByteSizeObserverCheap(byte[], Coder.Context) - Method in class com.google.cloud.dataflow.sdk.coders.ByteArrayCoder
-
Returns true since registerByteSizeObserver() runs in constant time.
- isRegisterByteSizeObserverCheap(T, Coder.Context) - Method in interface com.google.cloud.dataflow.sdk.coders.Coder
-
- isRegisterByteSizeObserverCheap(Double, Coder.Context) - Method in class com.google.cloud.dataflow.sdk.coders.DoubleCoder
-
Returns true since registerByteSizeObserver() runs in constant time.
- isRegisterByteSizeObserverCheap(IT, Coder.Context) - Method in class com.google.cloud.dataflow.sdk.coders.IterableLikeCoder
-
Returns whether iterable can use lazy counting, since that
requires minimal extra computation.
- isRegisterByteSizeObserverCheap(KV<K, V>, Coder.Context) - Method in class com.google.cloud.dataflow.sdk.coders.KvCoder
-
Returns whether both keyCoder and valueCoder are considered not expensive.
- isRegisterByteSizeObserverCheap(T, Coder.Context) - Method in class com.google.cloud.dataflow.sdk.coders.StandardCoder
-
StandardCoder requires elements to be fully encoded and copied
into a byte stream to determine the byte size of the element,
which is considered expensive.
- isRegisterByteSizeObserverCheap(Integer, Coder.Context) - Method in class com.google.cloud.dataflow.sdk.coders.VarIntCoder
-
Returns true since registerByteSizeObserver() runs in constant time.
- isRegisterByteSizeObserverCheap(Long, Coder.Context) - Method in class com.google.cloud.dataflow.sdk.coders.VarLongCoder
-
Returns true since registerByteSizeObserver() runs in constant time.
- isRegisterByteSizeObserverCheap(Void, Coder.Context) - Method in class com.google.cloud.dataflow.sdk.coders.VoidCoder
-
Returns true since registerByteSizeObserver() runs in constant time.
- isStreaming() - Method in interface com.google.cloud.dataflow.sdk.options.StreamingOptions
-
- isWholeStream - Variable in class com.google.cloud.dataflow.sdk.coders.Coder.Context
-
Whether the encoded or decoded value fills the remainder of the
output or input (resp.) record/stream contents.
- IterableCoder<T> - Class in com.google.cloud.dataflow.sdk.coders
-
An IterableCoder encodes Iterables.
- IterableCoder(Coder<T>) - Constructor for class com.google.cloud.dataflow.sdk.coders.IterableCoder
-
- IterableLikeCoder<T,IT extends java.lang.Iterable<T>> - Class in com.google.cloud.dataflow.sdk.coders
-
The base class of Coders for Iterable subclasses.
- IterableLikeCoder(Coder<T>) - Constructor for class com.google.cloud.dataflow.sdk.coders.IterableLikeCoder
-
- iterables() - Static method in class com.google.cloud.dataflow.sdk.transforms.Flatten
-
Returns a PTransform that takes a PCollection<Iterable<T>>
and returns a PCollection<T> containing all the elements from
all the Iterables.
- of(Class<T>) - Static method in class com.google.cloud.dataflow.sdk.coders.AvroCoder
-
Returns an AvroCoder instance for the provided element type.
- of(Schema) - Static method in class com.google.cloud.dataflow.sdk.coders.AvroCoder
-
Returns an AvroCoder instance for the Avro schema.
- of(Class<T>, Schema) - Static method in class com.google.cloud.dataflow.sdk.coders.AvroCoder
-
Returns an AvroCoder instance for the provided element type
using the provided Avro schema.
- of(String, String) - Static method in class com.google.cloud.dataflow.sdk.coders.AvroCoder
-
- of() - Static method in class com.google.cloud.dataflow.sdk.coders.BigEndianIntegerCoder
-
- of() - Static method in class com.google.cloud.dataflow.sdk.coders.BigEndianLongCoder
-
- of() - Static method in class com.google.cloud.dataflow.sdk.coders.ByteArrayCoder
-
- of(Coder<T>) - Static method in class com.google.cloud.dataflow.sdk.coders.CollectionCoder
-
- of(List<Object>) - Static method in class com.google.cloud.dataflow.sdk.coders.CollectionCoder
-
- of(String, String, String) - Static method in class com.google.cloud.dataflow.sdk.coders.CustomCoder
-
- of(Coder<DT>, DelegateCoder.CodingFunction<T, DT>, DelegateCoder.CodingFunction<DT, T>) - Static method in class com.google.cloud.dataflow.sdk.coders.DelegateCoder
-
- of() - Static method in class com.google.cloud.dataflow.sdk.coders.DoubleCoder
-
- of() - Static method in class com.google.cloud.dataflow.sdk.coders.EntityCoder
-
- of() - Static method in class com.google.cloud.dataflow.sdk.coders.InstantCoder
-
- of(Coder<T>) - Static method in class com.google.cloud.dataflow.sdk.coders.IterableCoder
-
- of(List<Coder<?>>) - Static method in class com.google.cloud.dataflow.sdk.coders.IterableCoder
-
- of(Coder<K>, Coder<V>) - Static method in class com.google.cloud.dataflow.sdk.coders.KvCoder
-
- of(List<Coder<?>>) - Static method in class com.google.cloud.dataflow.sdk.coders.KvCoder
-
- of(String, boolean, List<Coder<?>>) - Static method in class com.google.cloud.dataflow.sdk.coders.KvCoderBase
-
- of(Coder<T>) - Static method in class com.google.cloud.dataflow.sdk.coders.ListCoder
-
- of(List<Coder<?>>) - Static method in class com.google.cloud.dataflow.sdk.coders.ListCoder
-
- of(Coder<K>, Coder<V>) - Static method in class com.google.cloud.dataflow.sdk.coders.MapCoder
-
Produces a MapCoder with the given keyCoder and valueCoder.
- of(List<Coder<?>>) - Static method in class com.google.cloud.dataflow.sdk.coders.MapCoder
-
- of(String, List<Coder<?>>) - Static method in class com.google.cloud.dataflow.sdk.coders.MapCoderBase
-
- of(Class<T>) - Static method in class com.google.cloud.dataflow.sdk.coders.SerializableCoder
-
Returns a SerializableCoder instance for the provided element type.
- of(String) - Static method in class com.google.cloud.dataflow.sdk.coders.SerializableCoder
-
- of(Coder<T>) - Static method in class com.google.cloud.dataflow.sdk.coders.SetCoder
-
Produces a SetCoder with the given elementCoder.
- of(List<Object>) - Static method in class com.google.cloud.dataflow.sdk.coders.SetCoder
-
Dynamically typed constructor for JSON deserialization.
- of(Class<T>) - Static method in class com.google.cloud.dataflow.sdk.coders.StringDelegateCoder
-
- of() - Static method in class com.google.cloud.dataflow.sdk.coders.StringUtf8Coder
-
- of() - Static method in class com.google.cloud.dataflow.sdk.coders.TableRowJsonCoder
-
- of() - Static method in class com.google.cloud.dataflow.sdk.coders.TextualIntegerCoder
-
- of() - Static method in class com.google.cloud.dataflow.sdk.coders.VarIntCoder
-
- of() - Static method in class com.google.cloud.dataflow.sdk.coders.VarLongCoder
-
- of() - Static method in class com.google.cloud.dataflow.sdk.coders.VoidCoder
-
- of(WindowedValue<V>) - Static method in class com.google.cloud.dataflow.sdk.runners.DirectPipelineRunner.ValueWithMetadata
-
Returns a new ValueWithMetadata with the WindowedValue.
- of(Iterable<T>) - Static method in class com.google.cloud.dataflow.sdk.transforms.Create
-
Returns a new
Create root transform that produces a
PCollection containing the specified elements.
- of(T...) - Static method in class com.google.cloud.dataflow.sdk.transforms.Create
-
Returns a new
Create root transform that produces a
PCollection containing the specified elements.
- of(Map<K, V>) - Static method in class com.google.cloud.dataflow.sdk.transforms.Create
-
Returns a new
Create root transform that produces a
PCollection of
KVs corresponding to the keys and
values of the specified
Map.
- of(DoFn<I, O>) - Static method in class com.google.cloud.dataflow.sdk.transforms.DoFnTester
-
Returns a
DoFnTester supporting unit-testing of the given
DoFn.
- of(long) - Static method in class com.google.cloud.dataflow.sdk.transforms.First
-
Returns a First<T> PTransform.
- of(CoGbkResultSchema, UnionCoder) - Static method in class com.google.cloud.dataflow.sdk.transforms.join.CoGbkResult.CoGbkResultCoder
-
Returns a CoGbkResultCoder for the given schema and unionCoder.
- of(List<Coder<?>>, CoGbkResultSchema) - Static method in class com.google.cloud.dataflow.sdk.transforms.join.CoGbkResult.CoGbkResultCoder
-
- of(TupleTag<V>, List<V>) - Static method in class com.google.cloud.dataflow.sdk.transforms.join.CoGbkResult
-
Returns a new CoGbkResult that contains just the given tag the given data.
- of(List<TupleTag<?>>) - Static method in class com.google.cloud.dataflow.sdk.transforms.join.CoGbkResultSchema
-
- of(TupleTag<VI>, PCollection<KV<K, VI>>) - Static method in class com.google.cloud.dataflow.sdk.transforms.join.KeyedPCollectionTuple
-
Returns a new KeyedPCollections<K> with the given tag and initial
PCollection.
- of(DoFn<I, O>) - Static method in class com.google.cloud.dataflow.sdk.transforms.ParDo
-
Creates a
ParDo PTransform that will invoke the
given
DoFn function.
- of(DoFn<I, O>) - Method in class com.google.cloud.dataflow.sdk.transforms.ParDo.Unbound
-
Returns a new
ParDo PTransform that's like this
transform but which will invoke the given
DoFn
function, and which has its input and output types bound.
- of(DoFn<I, O>) - Method in class com.google.cloud.dataflow.sdk.transforms.ParDo.UnboundMulti
-
Returns a new multi-output
ParDo PTransform
that's like this transform but which will invoke the given
DoFn function, and which has its input type bound.
- of(int, Partition.PartitionFn<? super T>) - Static method in class com.google.cloud.dataflow.sdk.transforms.Partition
-
Returns a new Partition PTransform that divides
its input PCollection into the given number of partitions,
using the given partitioning function.
- of(int, C) - Static method in class com.google.cloud.dataflow.sdk.transforms.Top
-
Returns a PTransform that takes an input
PCollection<T> and returns a PCollection<List<T>> with a
single element containing the largest count elements of the input
PCollection<T>, in decreasing order, sorted using the
given Comparator<T>.
- of(Duration) - Static method in class com.google.cloud.dataflow.sdk.transforms.windowing.FixedWindows
-
Partitions the timestamp space into half-open intervals of the form
[N * size, (N + 1) * size), where 0 is the epoch.
- of(Duration) - Static method in class com.google.cloud.dataflow.sdk.transforms.windowing.SlidingWindows
-
Assigns timestamps into half-open intervals of the form
[N * period, N * period + size), where 0 is the epoch.
- of(SerializableFunction<V, K>) - Static method in class com.google.cloud.dataflow.sdk.transforms.WithKeys
-
Returns a PTransform that takes a PCollection<V>
and returns a PCollection<KV<K, V>>, where each of the
values in the input PCollection has been paired with a
key computed from the value by invoking the given
SerializableFunction.
- of(K) - Static method in class com.google.cloud.dataflow.sdk.transforms.WithKeys
-
Returns a PTransform that takes a PCollection<V>
and returns a PCollection<KV<K, V>>, where each of the
values in the input PCollection has been paired with the
given key.
- of(String, Coder<T>) - Static method in class com.google.cloud.dataflow.sdk.values.CodedTupleTag
-
Returns a CodedTupleTag with the given id which uses the
given Coder whenever a value associated with the tag
needs to be serialized.
- of(Map<CodedTupleTag<?>, Object>) - Static method in class com.google.cloud.dataflow.sdk.values.CodedTupleTagMap
-
Returns a CodedTupleTagMap containing the given mappings.
- of(K, V) - Static method in class com.google.cloud.dataflow.sdk.values.KV
-
Returns a KV with the given key and value.
- of(PCollection<T>) - Static method in class com.google.cloud.dataflow.sdk.values.PCollectionList
-
Returns a singleton PCollectionList containing the given PCollection.
- of(Iterable<PCollection<T>>) - Static method in class com.google.cloud.dataflow.sdk.values.PCollectionList
-
Returns a PCollectionList containing the given PCollections, in order.
- of(TupleTag<T>, PCollection<T>) - Static method in class com.google.cloud.dataflow.sdk.values.PCollectionTuple
-
Returns a singleton PCollectionTuple containing the given
PCollection keyed by the given TupleTag.
- of(V, Instant) - Static method in class com.google.cloud.dataflow.sdk.values.TimestampedValue
-
Returns a new TimestampedValue with the given value and timestamp.
- of(Coder<T>) - Static method in class com.google.cloud.dataflow.sdk.values.TimestampedValue.TimestampedValueCoder
-
- of(List<Object>) - Static method in class com.google.cloud.dataflow.sdk.values.TimestampedValue.TimestampedValueCoder
-
- of(TupleTag<?>) - Static method in class com.google.cloud.dataflow.sdk.values.TupleTagList
-
Returns a singleton TupleTagList containing the given TupleTag.
- of(List<TupleTag<?>>) - Static method in class com.google.cloud.dataflow.sdk.values.TupleTagList
-
Returns a TupleTagList containing the given TupleTags, in order.
- ofPrimitiveOutputsInternal(TupleTagList, WindowFn<?, ?>) - Static method in class com.google.cloud.dataflow.sdk.values.PCollectionTuple
-
Returns a PCollectionTuple with each of the given tags mapping to a new
output PCollection.
- OUTER - Static variable in class com.google.cloud.dataflow.sdk.coders.Coder.Context
-
The outer context.
- output(O) - Method in class com.google.cloud.dataflow.sdk.transforms.DoFn.Context
-
Adds the given element to the main output PCollection.
- outputWithTimestamp(O, Instant) - Method in class com.google.cloud.dataflow.sdk.transforms.DoFn.Context
-
Adds the given element to the main output PCollection,
with the given timestamp.
- ParDo - Class in com.google.cloud.dataflow.sdk.transforms
-
ParDo is the core element-wise transform in Google Cloud
Dataflow, invoking a user-specified function (from I to
O) on each of the elements of the input
PCollection<I> to produce zero or more output elements, all
of which are collected into the output PCollection<O>.
- ParDo() - Constructor for class com.google.cloud.dataflow.sdk.transforms.ParDo
-
- ParDo.Bound<I,O> - Class in com.google.cloud.dataflow.sdk.transforms
-
A PTransform that, when applied to a PCollection<I>,
invokes a user-specified DoFn<I, O> on all its elements,
with all its outputs collected into an output
PCollection<O>.
- ParDo.BoundMulti<I,O> - Class in com.google.cloud.dataflow.sdk.transforms
-
A PTransform that, when applied to a
PCollection<I>, invokes a user-specified
DoFn<I, O> on all its elements, which can emit elements
to any of the PTransform's main and side output
PCollections, which are bundled into a result
PCollectionTuple.
- ParDo.Unbound - Class in com.google.cloud.dataflow.sdk.transforms
-
An incomplete ParDo transform, with unbound input/output types.
- ParDo.UnboundMulti<O> - Class in com.google.cloud.dataflow.sdk.transforms
-
An incomplete multi-output ParDo transform, with unbound
input type.
- parseTableSpec(String) - Static method in class com.google.cloud.dataflow.sdk.io.BigQueryIO
-
Parse a table specification in the form
"[project_id]:[dataset_id].[table_id]" or "[dataset_id].[table_id]".
- Partition<T> - Class in com.google.cloud.dataflow.sdk.transforms
-
Partition takes a PCollection<T> and a
PartitionFn, uses the PartitionFn to split the
elements of the input PCollection into N partitions, and
returns a PCollectionList<T> that bundles N
PCollection<T>s containing the split elements.
- Partition.PartitionFn<T> - Interface in com.google.cloud.dataflow.sdk.transforms
-
A function object that chooses an output partition for an element.
- partitionFor(T, int) - Method in interface com.google.cloud.dataflow.sdk.transforms.Partition.PartitionFn
-
Chooses the partition into which to put the given element.
- PartitioningWindowFn<T,W extends BoundedWindow> - Class in com.google.cloud.dataflow.sdk.transforms.windowing
-
A
WindowFn that places each value into exactly one window
based on its timestamp and never merges windows.
- PartitioningWindowFn() - Constructor for class com.google.cloud.dataflow.sdk.transforms.windowing.PartitioningWindowFn
-
- PBegin - Class in com.google.cloud.dataflow.sdk.values
-
- PBegin(Pipeline) - Constructor for class com.google.cloud.dataflow.sdk.values.PBegin
-
Constructs a PBegin in the given Pipeline.
- PCollection<T> - Class in com.google.cloud.dataflow.sdk.values
-
A PCollection<T> is an immutable collection of values of type
T.
- PCollectionList<T> - Class in com.google.cloud.dataflow.sdk.values
-
A PCollectionList<T> is an immutable list of homogeneously
typed PCollection<T>s.
- pCollections() - Static method in class com.google.cloud.dataflow.sdk.transforms.Flatten
-
Returns a
PTransform that flattens a
CollectionList
into a
PCollection containing all the elements of all
the
PCollections in its input.
- PCollectionTuple - Class in com.google.cloud.dataflow.sdk.values
-
A
PCollectionTuple is an immutable tuple of
heterogeneously-typed
PCollections, "keyed" by
TupleTags.
- PCollectionView<T,WT> - Interface in com.google.cloud.dataflow.sdk.values
-
A
PCollectionView<T, WT> is an immutable view of a
PCollection that can be accessed e.g.
- PDone - Class in com.google.cloud.dataflow.sdk.values
-
PDone is the output of a
PTransform that
doesn't have a non-trival result, e.g., a Write.
- PDone() - Constructor for class com.google.cloud.dataflow.sdk.values.PDone
-
- peekOutputElements() - Method in class com.google.cloud.dataflow.sdk.transforms.DoFnTester
-
Returns the elements output so far to the main output.
- peekSideOutputElements(TupleTag<T>) - Method in class com.google.cloud.dataflow.sdk.transforms.DoFnTester
-
Returns the elements output so far to the side output with the
given tag.
- perElement() - Static method in class com.google.cloud.dataflow.sdk.transforms.Count
-
- perKey(int, C) - Static method in class com.google.cloud.dataflow.sdk.transforms.ApproximateQuantiles
-
Returns a PTransform that takes a
PCollection<KV<K, V>> and returns a
PCollection<KV<K, List<V>>> that contains an output
element mapping each distinct key in the input
PCollection to a List of the approximate
N-tiles of the values associated with that key in the
input PCollection.
- perKey(int) - Static method in class com.google.cloud.dataflow.sdk.transforms.ApproximateQuantiles
-
- perKey(int) - Static method in class com.google.cloud.dataflow.sdk.transforms.ApproximateUnique
-
Returns a PTransform that takes a
PCollection<KV<K, V>> and returns a
PCollection<KV<K, Long>> that contains an output element
mapping each distinct key in the input PCollection to an
estimate of the number of distinct values associated with that
key in the input PCollection.
- perKey(double) - Static method in class com.google.cloud.dataflow.sdk.transforms.ApproximateUnique
-
- perKey(SerializableFunction<Iterable<V>, V>) - Static method in class com.google.cloud.dataflow.sdk.transforms.Combine
-
Returns a
Combine.PerKey PTransform that
first groups its input
PCollection of
KVs by keys and
windows, then invokes the given function on each of the values lists to
produce a combined value, and then returns a
PCollection
of
KVs mapping each distinct key to its combined value for each
window.
- perKey(Combine.CombineFn<? super VI, ?, VO>) - Static method in class com.google.cloud.dataflow.sdk.transforms.Combine
-
Returns a
Combine.PerKey PTransform that
first groups its input
PCollection of
KVs by keys and
windows, then invokes the given function on each of the values lists to
produce a combined value, and then returns a
PCollection
of
KVs mapping each distinct key to its combined value for each
window.
- perKey(Combine.KeyedCombineFn<? super K, ? super VI, ?, VO>) - Static method in class com.google.cloud.dataflow.sdk.transforms.Combine
-
Returns a
Combine.PerKey PTransform that
first groups its input
PCollection of
KVs by keys and
windows, then invokes the given function on each of the key/values-lists
pairs to produce a combined value, and then returns a
PCollection of
KVs mapping each distinct key to
its combined value for each window.
- perKey() - Static method in class com.google.cloud.dataflow.sdk.transforms.Mean
-
Returns a PTransform that takes an input
PCollection<KV<K, N>> and returns a
PCollection<KV<K, Double>> that contains an output
element mapping each distinct key in the input
PCollection to the mean of the values associated with
that key in the input PCollection.
- perKey(int, C) - Static method in class com.google.cloud.dataflow.sdk.transforms.Top
-
Returns a PTransform that takes an input
PCollection<KV<K, V>> and returns a
PCollection<KV<K, List<V>>> that contains an output
element mapping each distinct key in the input
PCollection to the largest count values
associated with that key in the input
PCollection<KV<K, V>>, in decreasing order, sorted using
the given Comparator<V>.
- perWorker(DoFn<I, O>) - Static method in class com.google.cloud.dataflow.sdk.transforms.RateLimiting
-
Creates a new per-worker rate-limiting transform for the given
DoFn.
- PInput - Interface in com.google.cloud.dataflow.sdk.values
-
The abstract interface of things that might be input to a
PTransform.
- Pipeline - Class in com.google.cloud.dataflow.sdk
-
A Pipeline manages a DAG of PTransforms, and the PCollections
that the PTransforms consume and produce.
- Pipeline(PipelineRunner<?>) - Constructor for class com.google.cloud.dataflow.sdk.Pipeline
-
Deprecated.
- Pipeline(PipelineRunner<?>, PipelineOptions) - Constructor for class com.google.cloud.dataflow.sdk.Pipeline
-
- Pipeline.PipelineVisitor - Interface in com.google.cloud.dataflow.sdk
-
- PipelineOptions - Interface in com.google.cloud.dataflow.sdk.options
-
Dataflow SDK pipeline configuration options.
- PipelineOptionsFactory - Class in com.google.cloud.dataflow.sdk.options
-
- PipelineOptionsFactory() - Constructor for class com.google.cloud.dataflow.sdk.options.PipelineOptionsFactory
-
- PipelineOptionsFactory.Builder - Class in com.google.cloud.dataflow.sdk.options
-
A fluent PipelineOptions builder.
- PipelineOptionsRegistrar - Interface in com.google.cloud.dataflow.sdk.options
-
PipelineOptions creators have the ability to automatically have their
PipelineOptions registered with this SDK by creating a
ServiceLoader entry
and a concrete implementation of this interface.
- PipelineOptionsValidator - Class in com.google.cloud.dataflow.sdk.options
-
- PipelineOptionsValidator() - Constructor for class com.google.cloud.dataflow.sdk.options.PipelineOptionsValidator
-
- PipelineResult - Interface in com.google.cloud.dataflow.sdk
-
- PipelineRunner<Results extends PipelineResult> - Class in com.google.cloud.dataflow.sdk.runners
-
A PipelineRunner can execute, translate, or otherwise process a
Pipeline.
- PipelineRunner() - Constructor for class com.google.cloud.dataflow.sdk.runners.PipelineRunner
-
- PipelineRunnerRegistrar - Interface in com.google.cloud.dataflow.sdk.runners
-
PipelineRunner creators have the ability to automatically have their
PipelineRunner registered with this SDK by creating a
ServiceLoader entry
and a concrete implementation of this interface.
- popNode() - Method in class com.google.cloud.dataflow.sdk.runners.TransformHierarchy
-
Removes the last TransformTreeNode from the stack.
- POutput - Interface in com.google.cloud.dataflow.sdk.values
-
The abstract interface of things that might be output from a
PTransform.
- POutputValueBase - Class in com.google.cloud.dataflow.sdk.values
-
A POutputValueBase is the abstract base class of
PTransform outputs.
- POutputValueBase() - Constructor for class com.google.cloud.dataflow.sdk.values.POutputValueBase
-
- processBatch(I...) - Method in class com.google.cloud.dataflow.sdk.transforms.DoFnTester
-
- processElement(DoFn<KV<K, Iterable<VI>>, KV<K, VO>>.ProcessContext) - Method in class com.google.cloud.dataflow.sdk.runners.DirectPipelineRunner.TestCombineDoFn
-
- processElement(DoFn<I, O>.ProcessContext) - Method in class com.google.cloud.dataflow.sdk.transforms.DoFn
-
Processes an input element.
- processElement(I) - Method in class com.google.cloud.dataflow.sdk.transforms.DoFnTester
-
- processElement(DoFn<I, O>.ProcessContext) - Method in class com.google.cloud.dataflow.sdk.transforms.RateLimiting.RateLimitingDoFn
-
- producesSortedKeys(PipelineOptions) - Method in class com.google.cloud.dataflow.sdk.io.DatastoreIO.Source
-
- producesSortedKeys(PipelineOptions) - Method in class com.google.cloud.dataflow.sdk.io.Source
-
Whether this source is known to produce key/value pairs with the (encoded) keys in
lexicographically sorted order.
- PTransform<Input extends PInput,Output extends POutput> - Class in com.google.cloud.dataflow.sdk.transforms
-
A
PTransform<Input, Output> is an operation that takes an
Input (some subtype of
PInput) and produces an
Output (some subtype of
POutput).
- PTransform() - Constructor for class com.google.cloud.dataflow.sdk.transforms.PTransform
-
- PTransform(String) - Constructor for class com.google.cloud.dataflow.sdk.transforms.PTransform
-
- PubsubIO - Class in com.google.cloud.dataflow.sdk.io
-
Read and Write transforms for Pub/Sub streams.
- PubsubIO() - Constructor for class com.google.cloud.dataflow.sdk.io.PubsubIO
-
- PubsubIO.Read - Class in com.google.cloud.dataflow.sdk.io
-
A PTransform that continuously reads from a Pubsub stream and
returns a PCollection<String> containing the items from
the stream.
- PubsubIO.Read() - Constructor for class com.google.cloud.dataflow.sdk.io.PubsubIO.Read
-
- PubsubIO.Read.Bound - Class in com.google.cloud.dataflow.sdk.io
-
A PTransform that reads from a PubSub source and returns
a unbounded PCollection containing the items from the stream.
- PubsubIO.Validator - Class in com.google.cloud.dataflow.sdk.io
-
Utility class to validate topic and subscription names.
- PubsubIO.Validator() - Constructor for class com.google.cloud.dataflow.sdk.io.PubsubIO.Validator
-
- PubsubIO.Write - Class in com.google.cloud.dataflow.sdk.io
-
A PTransform that continuously writes a
PCollection<String> to a Pubsub stream.
- PubsubIO.Write() - Constructor for class com.google.cloud.dataflow.sdk.io.PubsubIO.Write
-
- PubsubIO.Write.Bound - Class in com.google.cloud.dataflow.sdk.io
-
A PTransfrom that writes a unbounded PCollection<String>
to a PubSub stream.
- pushNode(TransformTreeNode) - Method in class com.google.cloud.dataflow.sdk.runners.TransformHierarchy
-
Add a TransformTreeNode to the stack.
- PValue - Interface in com.google.cloud.dataflow.sdk.values
-
A
PValue is the interface to values that can be
input and output from
PTransforms.
- PValueBase - Class in com.google.cloud.dataflow.sdk.values
-
A
PValueBase is an abstract base class that provides
sensible default implementations for methods of
PValue.
- PValueBase() - Constructor for class com.google.cloud.dataflow.sdk.values.PValueBase
-
- Sample - Class in com.google.cloud.dataflow.sdk.transforms
-
PTransforms for taking samples of the elements in a
PCollection, or samples of the values associated with each
key in a PCollection of KVs.
- Sample() - Constructor for class com.google.cloud.dataflow.sdk.transforms.Sample
-
- Sample.FixedSizedSampleFn<T> - Class in com.google.cloud.dataflow.sdk.transforms
-
CombineFn that computes a fixed-size sample of a
collection of values.
- satisfies(SerializableFunction<Iterable<T>, Void>) - Method in class com.google.cloud.dataflow.sdk.testing.DataflowAssert.IterableAssert
-
- satisfies(DataflowAssert.AssertRelation<Iterable<T>, Iterable<T>>, Iterable<T>) - Method in class com.google.cloud.dataflow.sdk.testing.DataflowAssert.IterableAssert
-
- satisfies(SerializableFunction<T, Void>) - Method in class com.google.cloud.dataflow.sdk.testing.DataflowAssert.SingletonAssert
-
- satisfies(DataflowAssert.AssertRelation<T, T>, T) - Method in class com.google.cloud.dataflow.sdk.testing.DataflowAssert.SingletonAssert
-
- SerializableCoder<T extends java.io.Serializable> - Class in com.google.cloud.dataflow.sdk.coders
-
An encoder of Serializable objects.
- SerializableCoder(Class<T>) - Constructor for class com.google.cloud.dataflow.sdk.coders.SerializableCoder
-
- SerializableComparator<T> - Interface in com.google.cloud.dataflow.sdk.transforms
-
A Serializable Comparator.
- SerializableFunction<I,O> - Interface in com.google.cloud.dataflow.sdk.transforms
-
A function that computes an output value based on an input value,
and is Serializable.
- Sessions - Class in com.google.cloud.dataflow.sdk.transforms.windowing
-
A WindowFn windowing values into sessions separated by
Sessions.gapDuration-long
periods with no elements.
- set(long...) - Static method in class com.google.cloud.dataflow.sdk.testing.WindowFnTestUtils
-
- setApiRootUrl(String) - Method in interface com.google.cloud.dataflow.sdk.options.DataflowPipelineDebugOptions
-
- setAppName(String) - Method in interface com.google.cloud.dataflow.sdk.options.ApplicationNameOptions
-
- setAutoscalingAlgorithm(DataflowPipelineWorkerPoolOptions.AutoscalingAlgorithmType) - Method in interface com.google.cloud.dataflow.sdk.options.DataflowPipelineWorkerPoolOptions
-
- setCdbgVersion(String) - Method in interface com.google.cloud.dataflow.sdk.options.CloudDebuggerOptions
-
- setClusterManagerApi(DataflowPipelineWorkerPoolOptions.ClusterManagerApiType) - Method in interface com.google.cloud.dataflow.sdk.options.DataflowPipelineWorkerPoolOptions
-
- SetCoder<T> - Class in com.google.cloud.dataflow.sdk.coders
-
A SetCoder encodes Sets.
- SetCoder(Coder<T>) - Constructor for class com.google.cloud.dataflow.sdk.coders.SetCoder
-
- setCoder(Coder<T>) - Method in class com.google.cloud.dataflow.sdk.testing.DataflowAssert.IterableAssert
-
Sets the coder to use for elements of type T, as needed
for internal purposes.
- setCoder(Optional<Coder<T>>) - Method in class com.google.cloud.dataflow.sdk.testing.DataflowAssert.IterableAssert
-
Sets the coder to use for elements of type T, as needed
for internal purposes.
- setCoder(Coder<T>) - Method in class com.google.cloud.dataflow.sdk.testing.DataflowAssert.SingletonAssert
-
Sets the coder to use for elements of type T, as needed
for internal purposes.
- setCoder(Optional<Coder<T>>) - Method in class com.google.cloud.dataflow.sdk.testing.DataflowAssert.SingletonAssert
-
Sets the coder to use for elements of type T, as needed
for internal purposes.
- setCoder(Coder<T>) - Method in class com.google.cloud.dataflow.sdk.values.PCollection
-
Sets the Coder used by this PCollection to encode and decode the
values stored in it.
- setCoder(Coder<T>) - Method in class com.google.cloud.dataflow.sdk.values.TypedPValue
-
Sets the Coder used by this TypedPValue to encode and decode the
values stored in it.
- setCoderRegistry(CoderRegistry) - Method in class com.google.cloud.dataflow.sdk.Pipeline
-
- setCredentialDir(String) - Method in interface com.google.cloud.dataflow.sdk.options.GcpOptions
-
- setCredentialFactoryClass(Class<? extends CredentialFactory>) - Method in interface com.google.cloud.dataflow.sdk.options.GcpOptions
-
- setCredentialId(String) - Method in interface com.google.cloud.dataflow.sdk.options.GcpOptions
-
- setDataflowClient(Dataflow) - Method in interface com.google.cloud.dataflow.sdk.options.DataflowPipelineOptions
-
- setDataflowEndpoint(String) - Method in interface com.google.cloud.dataflow.sdk.options.DataflowPipelineDebugOptions
-
- setDataflowJobFile(String) - Method in interface com.google.cloud.dataflow.sdk.options.DataflowPipelineDebugOptions
-
- setDefaultWorkerLogLevel(DataflowWorkerLoggingOptions.Level) - Method in interface com.google.cloud.dataflow.sdk.options.DataflowWorkerLoggingOptions
-
- setDiskSizeGb(int) - Method in interface com.google.cloud.dataflow.sdk.options.DataflowPipelineWorkerPoolOptions
-
- setDiskSourceImage(String) - Method in interface com.google.cloud.dataflow.sdk.options.DataflowPipelineWorkerPoolOptions
-
- setExecutorService(ExecutorService) - Method in interface com.google.cloud.dataflow.sdk.options.GcsOptions
-
- setExperiments(List<String>) - Method in interface com.google.cloud.dataflow.sdk.options.DataflowPipelineDebugOptions
-
- setFilesToStage(List<String>) - Method in interface com.google.cloud.dataflow.sdk.options.DataflowPipelineWorkerPoolOptions
-
- setGCloudPath(String) - Method in interface com.google.cloud.dataflow.sdk.options.GcpOptions
-
- setGcpCredential(Credential) - Method in interface com.google.cloud.dataflow.sdk.options.GcpOptions
-
- setGcsUtil(GcsUtil) - Method in interface com.google.cloud.dataflow.sdk.options.GcsOptions
-
- setGoogleApiTrace(GoogleApiDebugOptions.GoogleApiTracer[]) - Method in interface com.google.cloud.dataflow.sdk.options.GoogleApiDebugOptions
-
- setHooks(DataflowPipelineRunnerHooks) - Method in class com.google.cloud.dataflow.sdk.runners.BlockingDataflowPipelineRunner
-
Sets callbacks to invoke during execution see DataflowPipelineRunnerHooks.
- setHooks(DataflowPipelineRunnerHooks) - Method in class com.google.cloud.dataflow.sdk.runners.DataflowPipelineRunner
-
Sets callbacks to invoke during execution see DataflowPipelineRunnerHooks.
- setJobId(String) - Method in interface com.google.cloud.dataflow.sdk.options.DataflowWorkerHarnessOptions
-
- setJobMessageOutput(PrintStream) - Method in interface com.google.cloud.dataflow.sdk.options.BlockingDataflowPipelineOptions
-
- setJobName(String) - Method in interface com.google.cloud.dataflow.sdk.options.DataflowPipelineOptions
-
- setMaxNumWorkers(int) - Method in interface com.google.cloud.dataflow.sdk.options.DataflowPipelineWorkerPoolOptions
-
- setName(String) - Method in class com.google.cloud.dataflow.sdk.transforms.PTransform
-
Sets the base name of this PTransform.
- setName(String) - Method in class com.google.cloud.dataflow.sdk.values.PCollection
-
Sets the name of this PCollection.
- setName(String) - Method in class com.google.cloud.dataflow.sdk.values.PValueBase
-
Sets the name of this PValueBase.
- setNumWorkers(int) - Method in interface com.google.cloud.dataflow.sdk.options.DataflowPipelineWorkerPoolOptions
-
- setOrdered(boolean) - Method in class com.google.cloud.dataflow.sdk.values.PCollection
-
Sets whether or not this PCollection should preserve the order in
which elements are put in it, such that subsequent parallel
reading of the PCollection is guaranteed to process the elements
in order.
- setOutput(TransformTreeNode, POutput) - Method in class com.google.cloud.dataflow.sdk.runners.TransformHierarchy
-
Sets the output of a transform node.
- setOutput(POutput) - Method in class com.google.cloud.dataflow.sdk.runners.TransformTreeNode
-
Adds an output to the transform node.
- setPathValidator(PathValidator) - Method in interface com.google.cloud.dataflow.sdk.options.DataflowPipelineDebugOptions
-
- setPathValidatorClass(Class<? extends PathValidator>) - Method in interface com.google.cloud.dataflow.sdk.options.DataflowPipelineDebugOptions
-
- setPCollection(PCollection<T>, List<T>) - Method in interface com.google.cloud.dataflow.sdk.runners.DirectPipelineRunner.EvaluationContext
-
Shorthand for setting the value of a PCollection where the elements do not have
timestamps or windows.
- setPCollectionValuesWithMetadata(PCollection<T>, List<DirectPipelineRunner.ValueWithMetadata<T>>) - Method in interface com.google.cloud.dataflow.sdk.runners.DirectPipelineRunner.EvaluationContext
-
Sets the value of the given PCollection, where each element also has a timestamp
and collection of windows.
- setPCollectionView(PCollectionView<T, WT>, Iterable<WindowedValue<R>>) - Method in interface com.google.cloud.dataflow.sdk.runners.DirectPipelineRunner.EvaluationContext
-
- setPipeline(Pipeline) - Method in class com.google.cloud.dataflow.sdk.transforms.PTransform
-
Deprecated.
- setPipelineInternal(Pipeline) - Method in class com.google.cloud.dataflow.sdk.values.PCollection
-
- setPipelineInternal(Pipeline) - Method in interface com.google.cloud.dataflow.sdk.values.PValue
-
- setPipelineInternal(Pipeline) - Method in class com.google.cloud.dataflow.sdk.values.PValueBase
-
Sets the owning Pipeline of this PValueBase.
- setPipelineInternal(Pipeline) - Method in class com.google.cloud.dataflow.sdk.values.TypedPValue
-
- setProject(String) - Method in interface com.google.cloud.dataflow.sdk.options.GcpOptions
-
- setRunner(Class<? extends PipelineRunner<?>>) - Method in interface com.google.cloud.dataflow.sdk.options.PipelineOptions
-
- setSecretsFile(String) - Method in interface com.google.cloud.dataflow.sdk.options.GcpOptions
-
- setServiceAccountKeyfile(String) - Method in interface com.google.cloud.dataflow.sdk.options.GcpOptions
-
- setServiceAccountName(String) - Method in interface com.google.cloud.dataflow.sdk.options.GcpOptions
-
- setSideInput(PCollectionView<?, ?>, Iterable<WindowedValue<?>>) - Method in class com.google.cloud.dataflow.sdk.transforms.DoFnTester
-
- setSideInputInGlobalWindow(PCollectionView<?, ?>, Iterable<?>) - Method in class com.google.cloud.dataflow.sdk.transforms.DoFnTester
-
- setSideInputs(Map<PCollectionView<?, ?>, Iterable<WindowedValue<?>>>) - Method in class com.google.cloud.dataflow.sdk.transforms.DoFnTester
-
Registers the tuple of values of the side input
PCollectionViews to
pass to the
DoFn under test.
- setSideOutputTags(TupleTagList) - Method in class com.google.cloud.dataflow.sdk.transforms.DoFnTester
-
Registers the list of TupleTags that can be used by the
DoFn under test to output to side output
PCollections.
- setStager(Stager) - Method in interface com.google.cloud.dataflow.sdk.options.DataflowPipelineDebugOptions
-
- setStagerClass(Class<? extends Stager>) - Method in interface com.google.cloud.dataflow.sdk.options.DataflowPipelineDebugOptions
-
- setStagingLocation(String) - Method in interface com.google.cloud.dataflow.sdk.options.DataflowPipelineOptions
-
- setStreaming(boolean) - Method in interface com.google.cloud.dataflow.sdk.options.StreamingOptions
-
- setTeardownPolicy(DataflowPipelineWorkerPoolOptions.TeardownPolicy) - Method in interface com.google.cloud.dataflow.sdk.options.DataflowPipelineWorkerPoolOptions
-
- setTempDatasetId(String) - Method in interface com.google.cloud.dataflow.sdk.options.BigQueryOptions
-
- setTempLocation(String) - Method in interface com.google.cloud.dataflow.sdk.options.DataflowPipelineOptions
-
- setTypeTokenInternal(TypeToken<T>) - Method in class com.google.cloud.dataflow.sdk.values.PCollection
-
Sets the TypeToken<T> for this PCollection<T>, so that
the enclosing PCollectionTuple, PCollectionList<T>,
or PTransform<?, PCollection<T>>, etc., can provide
more detailed reflective information.
- setTypeTokenInternal(TypeToken<T>) - Method in class com.google.cloud.dataflow.sdk.values.TypedPValue
-
Sets the TypeToken<T> associated with this class.
- setVersion(String) - Method in class com.google.cloud.dataflow.sdk.options.CloudDebuggerOptions.DebuggerConfig
-
- setWindowFnInternal(WindowFn<?, ?>) - Method in class com.google.cloud.dataflow.sdk.values.PCollection
-
- setWorkerId(String) - Method in interface com.google.cloud.dataflow.sdk.options.DataflowWorkerHarnessOptions
-
- setWorkerLogLevelOverrides(DataflowWorkerLoggingOptions.WorkerLogLevelOverride[]) - Method in interface com.google.cloud.dataflow.sdk.options.DataflowWorkerLoggingOptions
-
- setWorkerMachineType(String) - Method in interface com.google.cloud.dataflow.sdk.options.DataflowPipelineWorkerPoolOptions
-
- setZone(String) - Method in interface com.google.cloud.dataflow.sdk.options.DataflowPipelineWorkerPoolOptions
-
- ShardNameTemplate - Class in com.google.cloud.dataflow.sdk.io
-
Standard shard naming templates.
- ShardNameTemplate() - Constructor for class com.google.cloud.dataflow.sdk.io.ShardNameTemplate
-
- sideInput(PCollectionView<T, ?>) - Method in class com.google.cloud.dataflow.sdk.transforms.DoFn.Context
-
Returns the value of the side input.
- sideOutput(TupleTag<T>, T) - Method in class com.google.cloud.dataflow.sdk.transforms.DoFn.Context
-
Adds the given element to the side output PCollection with the
given tag.
- sideOutputWithTimestamp(TupleTag<T>, T, Instant) - Method in class com.google.cloud.dataflow.sdk.transforms.DoFn.Context
-
Adds the given element to the specified side output PCollection,
with the given timestamp.
- size() - Method in class com.google.cloud.dataflow.sdk.transforms.join.CoGbkResultSchema
-
Returns the number of columms for this schema.
- size() - Method in class com.google.cloud.dataflow.sdk.values.PCollectionList
-
Returns the number of PCollections in this PCollectionList.
- size() - Method in class com.google.cloud.dataflow.sdk.values.TupleTagList
-
Returns the number of TupleTags in this TupleTagList.
- SlidingWindows - Class in com.google.cloud.dataflow.sdk.transforms.windowing
-
A WindowFn that windows values into possibly overlapping fixed-size
timestamp-based windows.
- smallest(int) - Static method in class com.google.cloud.dataflow.sdk.transforms.Top
-
Returns a PTransform that takes an input
PCollection<T> and returns a PCollection<List<T>> with a
single element containing the smallest count elements of the input
PCollection<T>, in increasing order, sorted according to
their natural order.
- smallestPerKey(int) - Static method in class com.google.cloud.dataflow.sdk.transforms.Top
-
Returns a PTransform that takes an input
PCollection<KV<K, V>> and returns a
PCollection<KV<K, List<V>>> that contains an output
element mapping each distinct key in the input
PCollection to the smallest count values
associated with that key in the input
PCollection<KV<K, V>>, in increasing order, sorted
according to their natural order.
- Source<T> - Class in com.google.cloud.dataflow.sdk.io
-
Base class for defining input formats, with custom logic for splitting the input
into shards (parts of the input, each of which may be processed on a different worker)
and creating a Source for reading the input.
- Source() - Constructor for class com.google.cloud.dataflow.sdk.io.Source
-
- Source.Reader<T> - Interface in com.google.cloud.dataflow.sdk.io
-
The interface which readers of custom input sources must implement.
- span(IntervalWindow) - Method in class com.google.cloud.dataflow.sdk.transforms.windowing.IntervalWindow
-
Returns the minimal window that includes both this window and
the given window.
- splitIntoShards(long, PipelineOptions) - Method in class com.google.cloud.dataflow.sdk.io.DatastoreIO.Source
-
- splitIntoShards(long, PipelineOptions) - Method in class com.google.cloud.dataflow.sdk.io.Source
-
Splits the source into shards.
- StandardCoder<T> - Class in com.google.cloud.dataflow.sdk.coders
-
- StandardCoder() - Constructor for class com.google.cloud.dataflow.sdk.coders.StandardCoder
-
- start() - Method in class com.google.cloud.dataflow.sdk.transforms.windowing.IntervalWindow
-
Returns the start of this window, inclusive.
- startBundle(DoFn<I, O>.Context) - Method in class com.google.cloud.dataflow.sdk.transforms.DoFn
-
Prepares this DoFn instance for processing a batch of elements.
- startBundle() - Method in class com.google.cloud.dataflow.sdk.transforms.DoFnTester
-
- startBundle(DoFn<I, O>.Context) - Method in class com.google.cloud.dataflow.sdk.transforms.RateLimiting.RateLimitingDoFn
-
- store(CodedTupleTag<T>, T) - Method in interface com.google.cloud.dataflow.sdk.transforms.DoFn.KeyedState
-
Updates this KeyedState in place so that the given tag
maps to the given value.
- StreamingOptions - Interface in com.google.cloud.dataflow.sdk.options
-
Options used to configure the streaming backend.
- StringDelegateCoder<T> - Class in com.google.cloud.dataflow.sdk.coders
-
A StringDelegateCoder<T> wraps a Coder
and encodes/decodes values of type T via string representations.
- StringDelegateCoder(Class<T>) - Constructor for class com.google.cloud.dataflow.sdk.coders.StringDelegateCoder
-
- StringUtf8Coder - Class in com.google.cloud.dataflow.sdk.coders
-
A StringUtf8Coder encodes Java Strings in UTF-8 encoding.
- subscription(String) - Method in class com.google.cloud.dataflow.sdk.io.PubsubIO.Read.Bound
-
Returns a new TextIO.Read PTransform that's like this one but reading from the
given subscription.
- subscription(String) - Static method in class com.google.cloud.dataflow.sdk.io.PubsubIO.Read
-
Creates and returns a PubsubIO.Read PTransform for reading from
a specific Pubsub subscription.
- Sum - Class in com.google.cloud.dataflow.sdk.transforms
-
PTransforms for computing the sum of the elements in a
PCollection, or the sum of the values associated with
each key in a PCollection of KVs.
- Sum() - Constructor for class com.google.cloud.dataflow.sdk.transforms.Sum
-
- Sum.SumDoubleFn - Class in com.google.cloud.dataflow.sdk.transforms
-
- Sum.SumDoubleFn() - Constructor for class com.google.cloud.dataflow.sdk.transforms.Sum.SumDoubleFn
-
- Sum.SumIntegerFn - Class in com.google.cloud.dataflow.sdk.transforms
-
- Sum.SumIntegerFn() - Constructor for class com.google.cloud.dataflow.sdk.transforms.Sum.SumIntegerFn
-
- Sum.SumLongFn - Class in com.google.cloud.dataflow.sdk.transforms
-
- Sum.SumLongFn() - Constructor for class com.google.cloud.dataflow.sdk.transforms.Sum.SumLongFn
-
- TableRowJsonCoder - Class in com.google.cloud.dataflow.sdk.coders
-
A TableRowJsonCoder encodes BigQuery TableRow objects.
- takeOutputElements() - Method in class com.google.cloud.dataflow.sdk.transforms.DoFnTester
-
Returns the elements output so far to the main output.
- takeSideOutputElements(TupleTag<T>) - Method in class com.google.cloud.dataflow.sdk.transforms.DoFnTester
-
Returns the elements output so far to the side output with the given tag.
- TestDataflowPipelineOptions - Interface in com.google.cloud.dataflow.sdk.testing
-
- TestDataflowPipelineRunner - Class in com.google.cloud.dataflow.sdk.testing
-
TestDataflowPipelineRunner is a pipeline runner that wraps a
DataflowPipelineRunner when running tests against the
TestPipeline.
- TestPipeline - Class in com.google.cloud.dataflow.sdk.testing
-
A creator of test pipelines which can be used inside of tests that can be
configured to run locally or against the live service.
- TextIO - Class in com.google.cloud.dataflow.sdk.io
-
Transforms for reading and writing text files.
- TextIO() - Constructor for class com.google.cloud.dataflow.sdk.io.TextIO
-
- TextIO.CompressionType - Enum in com.google.cloud.dataflow.sdk.io
-
Possible text file compression types.
- TextIO.Read - Class in com.google.cloud.dataflow.sdk.io
-
A root PTransform that reads from a text file (or multiple text
files matching a pattern) and returns a PCollection containing
the decoding of each of the lines of the text file(s).
- TextIO.Read() - Constructor for class com.google.cloud.dataflow.sdk.io.TextIO.Read
-
- TextIO.Read.Bound<T> - Class in com.google.cloud.dataflow.sdk.io
-
A root PTransform that reads from a text file (or multiple text files
matching a pattern) and returns a bounded PCollection containing the
decoding of each of the lines of the text file(s).
- TextIO.Write - Class in com.google.cloud.dataflow.sdk.io
-
A PTransform that writes a PCollection to a text file (or
multiple text files matching a sharding pattern), with each
PCollection element being encoded into its own line.
- TextIO.Write() - Constructor for class com.google.cloud.dataflow.sdk.io.TextIO.Write
-
- TextIO.Write.Bound<T> - Class in com.google.cloud.dataflow.sdk.io
-
A PTransform that writes a bounded PCollection to a text file (or
multiple text files matching a sharding pattern), with each
PCollection element being encoded into its own line.
- TextualIntegerCoder - Class in com.google.cloud.dataflow.sdk.coders
-
A TextualIntegerCoder encodes Integers as text.
- that(PCollection<T>) - Static method in class com.google.cloud.dataflow.sdk.testing.DataflowAssert
-
- thatIterable(PCollectionView<Iterable<T>, ?>) - Static method in class com.google.cloud.dataflow.sdk.testing.DataflowAssert
-
- thatSingleton(PCollection<T>) - Static method in class com.google.cloud.dataflow.sdk.testing.DataflowAssert
-
- thatSingletonIterable(PCollection<Iterable<T>>) - Static method in class com.google.cloud.dataflow.sdk.testing.DataflowAssert
-
- timestamp() - Method in class com.google.cloud.dataflow.sdk.transforms.DoFn.ProcessContext
-
Returns the timestamp of the input element.
- timestamp() - Method in class com.google.cloud.dataflow.sdk.transforms.windowing.WindowFn.AssignContext
-
Returns the timestamp of the current element.
- timestamped(Iterable<TimestampedValue<T>>) - Static method in class com.google.cloud.dataflow.sdk.transforms.Create
-
Returns a new root transform that produces a
PCollection containing
the specified elements with the specified timestamps.
- timestamped(TimestampedValue<T>...) - Static method in class com.google.cloud.dataflow.sdk.transforms.Create
-
Returns a new root transform that produces a
PCollection containing
the specified elements with the specified timestamps.
- timestamped(Iterable<T>, Iterable<Long>) - Static method in class com.google.cloud.dataflow.sdk.transforms.Create
-
Returns a new root transform that produces a
PCollection containing
the specified elements with the specified timestamps.
- TimestampedValue<V> - Class in com.google.cloud.dataflow.sdk.values
-
An immutable (value, timestamp) pair.
- TimestampedValue(V, Instant) - Constructor for class com.google.cloud.dataflow.sdk.values.TimestampedValue
-
- TimestampedValue.TimestampedValueCoder<T> - Class in com.google.cloud.dataflow.sdk.values
-
Coder for TimestampedValue.
- timestampLabel(String) - Method in class com.google.cloud.dataflow.sdk.io.PubsubIO.Read.Bound
-
Returns a new TextIO.Read PTransform that's like this one but reading timestamps
from the given PubSub label.
- timestampLabel(String) - Static method in class com.google.cloud.dataflow.sdk.io.PubsubIO.Read
-
Creates and returns a PubsubIO.Read PTransform where record timestamps are expected
to be provided using the PubSub labeling API.
- timestampLabel(String) - Method in class com.google.cloud.dataflow.sdk.io.PubsubIO.Write.Bound
-
Returns a new TextIO.Write PTransform that's like this one but publishing timestamps
to the given PubSub label.
- timestampLabel(String) - Static method in class com.google.cloud.dataflow.sdk.io.PubsubIO.Write
-
If specified, Dataflow will add a Pubsub label to each output record specifying the logical
timestamp of the record.
- to(String) - Method in class com.google.cloud.dataflow.sdk.io.AvroIO.Write.Bound
-
Returns a new AvroIO.Write PTransform that's like this one but
that writes to the file(s) with the given filename prefix.
- to(String) - Static method in class com.google.cloud.dataflow.sdk.io.AvroIO.Write
-
Returns an AvroIO.Write PTransform that writes to the file(s)
with the given prefix.
- to(String) - Method in class com.google.cloud.dataflow.sdk.io.BigQueryIO.Write.Bound
-
Specifies the table specification.
- to(TableReference) - Method in class com.google.cloud.dataflow.sdk.io.BigQueryIO.Write.Bound
-
Specifies the table to be written to.
- to(String) - Static method in class com.google.cloud.dataflow.sdk.io.BigQueryIO.Write
-
Creates a write transformation for the given table specification.
- to(TableReference) - Static method in class com.google.cloud.dataflow.sdk.io.BigQueryIO.Write
-
Creates a write transformation for the given table.
- to(String) - Method in class com.google.cloud.dataflow.sdk.io.DatastoreIO.Sink
-
Returns a DatastoreIO.Write PTransform with given datasetId.
- to(String) - Method in class com.google.cloud.dataflow.sdk.io.TextIO.Write.Bound
-
Returns a new TextIO.Write PTransform that's like this one but
that writes to the file(s) with the given filename prefix.
- to(String) - Static method in class com.google.cloud.dataflow.sdk.io.TextIO.Write
-
Returns a TextIO.Write PTransform that writes to the file(s)
with the given prefix.
- Top - Class in com.google.cloud.dataflow.sdk.transforms
-
PTransforms for finding the largest (or smallest) set
of elements in a PCollection, or the largest (or smallest)
set of values associated with each key in a PCollection of
KVs.
- Top() - Constructor for class com.google.cloud.dataflow.sdk.transforms.Top
-
- Top.Largest<T extends java.lang.Comparable<T>> - Class in com.google.cloud.dataflow.sdk.transforms
-
Serializable Comparator that that uses the
compared elements' natural ordering.
- Top.Largest() - Constructor for class com.google.cloud.dataflow.sdk.transforms.Top.Largest
-
- Top.Smallest<T extends java.lang.Comparable<T>> - Class in com.google.cloud.dataflow.sdk.transforms
-
Serializable Comparator that that uses the
reverse of the compared elements' natural ordering.
- Top.Smallest() - Constructor for class com.google.cloud.dataflow.sdk.transforms.Top.Smallest
-
- Top.TopCombineFn<T> - Class in com.google.cloud.dataflow.sdk.transforms
-
CombineFn for Top transforms that combines a
bunch of Ts into a single count-long
List<T>, using compareFn to choose the largest
Ts.
- Top.TopCombineFn(int, C) - Constructor for class com.google.cloud.dataflow.sdk.transforms.Top.TopCombineFn
-
- topic(String) - Method in class com.google.cloud.dataflow.sdk.io.PubsubIO.Read.Bound
-
Returns a new TextIO.Read PTransform that's like this one but reading from the
give topic.
- topic(String) - Static method in class com.google.cloud.dataflow.sdk.io.PubsubIO.Read
-
Creates and returns a PubsubIO.Read PTransform for reading from
a Pubsub topic with the specified publisher topic.
- topic(String) - Method in class com.google.cloud.dataflow.sdk.io.PubsubIO.Write.Bound
-
Returns a new TextIO.Write PTransform that's like this one but writing to the given
topic.
- topic(String) - Static method in class com.google.cloud.dataflow.sdk.io.PubsubIO.Write
-
The topic to publish to.
- toString() - Method in class com.google.cloud.dataflow.sdk.coders.DelegateCoder
-
- toString() - Method in class com.google.cloud.dataflow.sdk.coders.StandardCoder
-
- toString() - Method in class com.google.cloud.dataflow.sdk.coders.StringDelegateCoder
-
- toString() - Method in class com.google.cloud.dataflow.sdk.options.DataflowWorkerLoggingOptions.WorkerLogLevelOverride
-
- toString() - Method in class com.google.cloud.dataflow.sdk.options.GoogleApiDebugOptions.GoogleApiTracer
-
- toString() - Method in class com.google.cloud.dataflow.sdk.Pipeline
-
- toString() - Method in class com.google.cloud.dataflow.sdk.runners.DataflowPipelineRunner
-
- toString() - Method in class com.google.cloud.dataflow.sdk.runners.DataflowPipelineTranslator
-
- toString() - Method in class com.google.cloud.dataflow.sdk.runners.DirectPipelineRunner
-
- toString() - Method in class com.google.cloud.dataflow.sdk.transforms.join.CoGbkResult
-
- toString() - Method in class com.google.cloud.dataflow.sdk.transforms.join.CoGbkResultSchema
-
- toString() - Method in class com.google.cloud.dataflow.sdk.transforms.join.RawUnionValue
-
- toString() - Method in class com.google.cloud.dataflow.sdk.transforms.PTransform
-
- toString() - Method in class com.google.cloud.dataflow.sdk.transforms.windowing.IntervalWindow
-
- toString() - Method in class com.google.cloud.dataflow.sdk.values.CodedTupleTag
-
- toString() - Method in class com.google.cloud.dataflow.sdk.values.KV
-
- toString() - Method in class com.google.cloud.dataflow.sdk.values.PValueBase
-
- toString() - Method in class com.google.cloud.dataflow.sdk.values.TupleTag
-
- toTableSpec(TableReference) - Static method in class com.google.cloud.dataflow.sdk.io.BigQueryIO
-
Returns a canonical string representation of the TableReference.
- TransformHierarchy - Class in com.google.cloud.dataflow.sdk.runners
-
Captures information about a collection of transformations and their
associated PValues.
- TransformHierarchy() - Constructor for class com.google.cloud.dataflow.sdk.runners.TransformHierarchy
-
- transforms - Variable in class com.google.cloud.dataflow.sdk.runners.RecordingPipelineVisitor
-
- TransformTreeNode - Class in com.google.cloud.dataflow.sdk.runners
-
Provides internal tracking of transform relationships with helper methods
for initialization and ordered visitation.
- TransformTreeNode(TransformTreeNode, PTransform<?, ?>, String, PInput) - Constructor for class com.google.cloud.dataflow.sdk.runners.TransformTreeNode
-
Creates a new TransformTreeNode with the given parent and transform.
- translate(PT, DataflowPipelineTranslator.TranslationContext) - Method in interface com.google.cloud.dataflow.sdk.runners.DataflowPipelineTranslator.TransformTranslator
-
- translate(Pipeline, List<DataflowPackage>) - Method in class com.google.cloud.dataflow.sdk.runners.DataflowPipelineTranslator
-
- traverseTopologically(Pipeline.PipelineVisitor) - Method in class com.google.cloud.dataflow.sdk.Pipeline
-
- TupleTag<V> - Class in com.google.cloud.dataflow.sdk.values
-
A
TupleTag is a typed tag to use as the key of a
heterogeneously typed tuple, like
PCollectionTuple or
Its generic type parameter allows tracking
the static type of things stored in tuples.
- TupleTag() - Constructor for class com.google.cloud.dataflow.sdk.values.TupleTag
-
Constructs a new TupleTag, with a fresh unique id.
- TupleTag(String) - Constructor for class com.google.cloud.dataflow.sdk.values.TupleTag
-
Constructs a new TupleTag with the given id.
- TupleTagList - Class in com.google.cloud.dataflow.sdk.values
-
A
TupleTagList is an immutable list of heterogeneously
typed
TupleTags.
- TypedPValue<T> - Class in com.google.cloud.dataflow.sdk.values
-
A TypedPValue<T> is the abstract base class of things that
store some number of values of type T.
- TypedPValue() - Constructor for class com.google.cloud.dataflow.sdk.values.TypedPValue
-
- waitToFinish(long, TimeUnit, MonitoringUtil.JobMessagesHandler) - Method in class com.google.cloud.dataflow.sdk.runners.DataflowPipelineJob
-
Wait for the job to finish and return the final status.
- weeks(int, int) - Static method in class com.google.cloud.dataflow.sdk.transforms.windowing.CalendarWindows
-
Returns a
WindowFn that windows elements into periods measured by weeks.
- Window - Class in com.google.cloud.dataflow.sdk.transforms.windowing
-
Window logically divides up or groups the elements of a
PCollection into finite windows according to a
WindowFn.
- Window() - Constructor for class com.google.cloud.dataflow.sdk.transforms.windowing.Window
-
- Window.Bound<T> - Class in com.google.cloud.dataflow.sdk.transforms.windowing
-
A PTransform that windows the elements of a PCollection<T>,
into finite windows according to a user-specified WindowFn<T, B>.
- Window.Remerge<T> - Class in com.google.cloud.dataflow.sdk.transforms.windowing
-
PTransform that does not change assigned windows, but will cause
windows to be merged again as part of the next
GroupByKey.
- Window.Remerge() - Constructor for class com.google.cloud.dataflow.sdk.transforms.windowing.Window.Remerge
-
- Window.Unbound - Class in com.google.cloud.dataflow.sdk.transforms.windowing
-
An incomplete Window transform, with unbound input/output type.
- windowCoder() - Method in class com.google.cloud.dataflow.sdk.transforms.windowing.CalendarWindows.DaysWindows
-
- windowCoder() - Method in class com.google.cloud.dataflow.sdk.transforms.windowing.CalendarWindows.MonthsWindows
-
- windowCoder() - Method in class com.google.cloud.dataflow.sdk.transforms.windowing.CalendarWindows.YearsWindows
-
- windowCoder() - Method in class com.google.cloud.dataflow.sdk.transforms.windowing.FixedWindows
-
- windowCoder() - Method in class com.google.cloud.dataflow.sdk.transforms.windowing.GlobalWindows
-
- windowCoder() - Method in class com.google.cloud.dataflow.sdk.transforms.windowing.InvalidWindows
-
- windowCoder() - Method in class com.google.cloud.dataflow.sdk.transforms.windowing.Sessions
-
- windowCoder() - Method in class com.google.cloud.dataflow.sdk.transforms.windowing.SlidingWindows
-
- windowCoder() - Method in class com.google.cloud.dataflow.sdk.transforms.windowing.WindowFn
-
Returns the
Coder used for serializing the windows used
by this windowFn.
- WindowFn<T,W extends BoundedWindow> - Class in com.google.cloud.dataflow.sdk.transforms.windowing
-
The argument to the
Window transform used to assign elements into
windows and to determine how windows are merged.
- WindowFn() - Constructor for class com.google.cloud.dataflow.sdk.transforms.windowing.WindowFn
-
- WindowFn.AssignContext - Class in com.google.cloud.dataflow.sdk.transforms.windowing
-
- WindowFn.AssignContext() - Constructor for class com.google.cloud.dataflow.sdk.transforms.windowing.WindowFn.AssignContext
-
- WindowFn.MergeContext - Class in com.google.cloud.dataflow.sdk.transforms.windowing
-
- WindowFn.MergeContext() - Constructor for class com.google.cloud.dataflow.sdk.transforms.windowing.WindowFn.MergeContext
-
- WindowFnTestUtils - Class in com.google.cloud.dataflow.sdk.testing
-
- WindowFnTestUtils() - Constructor for class com.google.cloud.dataflow.sdk.testing.WindowFnTestUtils
-
- windows() - Method in class com.google.cloud.dataflow.sdk.transforms.DoFn.ProcessContext
-
Returns the set of windows to which the input element has been assigned.
- windows() - Method in class com.google.cloud.dataflow.sdk.transforms.windowing.WindowFn.AssignContext
-
Returns the windows the current element was in, prior to this
AssignFn being called.
- windows() - Method in class com.google.cloud.dataflow.sdk.transforms.windowing.WindowFn.MergeContext
-
Returns the current set of windows.
- withCoder(Coder<T1>) - Method in class com.google.cloud.dataflow.sdk.io.TextIO.Read.Bound
-
Returns a new TextIO.Read PTransform that's like this one but
that uses the given Coder<T1> to decode each of the
lines of the file into a value of type T1.
- withCoder(Coder<T>) - Static method in class com.google.cloud.dataflow.sdk.io.TextIO.Read
-
Returns a TextIO.Read PTransform that uses the given
Coder<T> to decode each of the lines of the file into a
value of type T.
- withCoder(Coder<T1>) - Method in class com.google.cloud.dataflow.sdk.io.TextIO.Write.Bound
-
Returns a new TextIO.Write PTransform that's like this one
but that uses the given Coder<T1> to encode each of
the elements of the input PCollection<T1> into an
output text line.
- withCoder(Coder<T>) - Static method in class com.google.cloud.dataflow.sdk.io.TextIO.Write
-
Returns a TextIO.Write PTransform that uses the given
Coder<T> to encode each of the elements of the input
PCollection<T> into an output text line.
- withCompressionType(TextIO.CompressionType) - Method in class com.google.cloud.dataflow.sdk.io.TextIO.Read.Bound
-
Returns a new TextIO.Read PTransform that's like this one but
reads from input sources using the specified compression type.
- withCompressionType(TextIO.CompressionType) - Static method in class com.google.cloud.dataflow.sdk.io.TextIO.Read
-
Returns a TextIO.Read PTransform that reads from a file with the
specified compression type.
- withCreateDisposition(BigQueryIO.Write.CreateDisposition) - Method in class com.google.cloud.dataflow.sdk.io.BigQueryIO.Write.Bound
-
Specifies options for creating the table.
- withCreateDisposition(BigQueryIO.Write.CreateDisposition) - Static method in class com.google.cloud.dataflow.sdk.io.BigQueryIO.Write
-
Specifies options for creating the table.
- withDataset(String) - Method in class com.google.cloud.dataflow.sdk.io.DatastoreIO.Source
-
- withEncodabilityTesting(boolean) - Method in class com.google.cloud.dataflow.sdk.runners.DirectPipelineRunner
-
Enable runtime testing to verify that all values can be encoded.
- withEpsilon(double) - Method in class com.google.cloud.dataflow.sdk.transforms.ApproximateQuantiles.ApproximateQuantilesCombineFn
-
Returns an ApproximateQuantilesCombineFn that's like
this one except that it uses the specified epsilon
value.
- withGapDuration(Duration) - Static method in class com.google.cloud.dataflow.sdk.transforms.windowing.Sessions
-
Creates a
Sessions WindowFn with the specified gap duration.
- withHost(String) - Method in class com.google.cloud.dataflow.sdk.io.DatastoreIO.Sink
-
Returns a new DatastoreIO.Write PTransform with specified host.
- withHost(String) - Method in class com.google.cloud.dataflow.sdk.io.DatastoreIO.Source
-
- withHotKeys(SerializableFunction<? super K, Integer>) - Method in class com.google.cloud.dataflow.sdk.transforms.Combine.PerKey
-
If a single key has disproportionately many values, it may become a
bottleneck, especially in streaming mode.
- withHotKeys(int) - Method in class com.google.cloud.dataflow.sdk.transforms.Combine.PerKey
-
- withKey(Object) - Method in class com.google.cloud.dataflow.sdk.runners.DirectPipelineRunner.ValueWithMetadata
-
Returns a new ValueWithMetadata with the implicit key associated
with this value set.
- WithKeys<K,V> - Class in com.google.cloud.dataflow.sdk.transforms
-
WithKeys<K, V> takes a PCollection<V>, and either a
constant key of type K or a function from V to
K, and returns a PCollection<KV<K, V>>, where each
of the values in the input PCollection has been paired with
either the constant key or a key computed from the value.
- withMaxInputSize(long) - Method in class com.google.cloud.dataflow.sdk.transforms.ApproximateQuantiles.ApproximateQuantilesCombineFn
-
Returns an ApproximateQuantilesCombineFn that's like
this one except that it uses the specified maxNumElements
value.
- withMaxParallelism(int) - Method in class com.google.cloud.dataflow.sdk.transforms.RateLimiting.RateLimitingTransform
-
Modifies this RateLimitingTransform, specifying a maximum
per-worker parallelism.
- withMockEstimateSizeBytes(Supplier<Long>) - Method in class com.google.cloud.dataflow.sdk.io.DatastoreIO.Source
-
For testing only.
- withName(String) - Method in class com.google.cloud.dataflow.sdk.transforms.Combine.PerKey
-
- withName(String) - Method in class com.google.cloud.dataflow.sdk.transforms.Combine.PerKeyWithHotKeys
-
- withName(String) - Method in class com.google.cloud.dataflow.sdk.transforms.PTransform
-
Sets the base name of this PTransform and returns itself.
- withNumShards(int) - Method in class com.google.cloud.dataflow.sdk.io.AvroIO.Write.Bound
-
Returns a new AvroIO.Write PTransform that's like this one but
that uses the provided shard count.
- withNumShards(int) - Static method in class com.google.cloud.dataflow.sdk.io.AvroIO.Write
-
Returns an AvroIO.Write PTransform that uses the provided shard count.
- withNumShards(int) - Method in class com.google.cloud.dataflow.sdk.io.TextIO.Write.Bound
-
Returns a new TextIO.Write PTransform that's like this one but
that uses the provided shard count.
- withNumShards(int) - Static method in class com.google.cloud.dataflow.sdk.io.TextIO.Write
-
Returns a TextIO.Write PTransform that uses the provided shard count.
- withOffset(Duration) - Method in class com.google.cloud.dataflow.sdk.transforms.windowing.FixedWindows
-
Partitions the timestamp space into half-open intervals of the form
[N * size + offset, (N + 1) * size + offset),
where 0 is the epoch.
- withOffset(Duration) - Method in class com.google.cloud.dataflow.sdk.transforms.windowing.SlidingWindows
-
Assigns timestamps into half-open intervals of the form
[N * period + offset, N * period + offset + size).
- withOutputTags(TupleTag<O>, TupleTagList) - Method in class com.google.cloud.dataflow.sdk.transforms.ParDo.Bound
-
Returns a new multi-output ParDo PTransform
that's like this PTransform but with the specified main
and side output tags.
- withOutputTags(TupleTag<O>, TupleTagList) - Method in class com.google.cloud.dataflow.sdk.transforms.ParDo.Unbound
-
Returns a new multi-output ParDo transform that's like
this transform but with the specified main and side output
tags.
- withOutputTags(TupleTag<O>, TupleTagList) - Static method in class com.google.cloud.dataflow.sdk.transforms.ParDo
-
Creates a multi-output
ParDo PTransform whose
output
PCollections will be referenced using the given main
output and side output tags.
- withoutSharding() - Method in class com.google.cloud.dataflow.sdk.io.AvroIO.Write.Bound
-
Returns a new AvroIO.Write PTransform that's like this one but
that forces a single file as output.
- withoutSharding() - Static method in class com.google.cloud.dataflow.sdk.io.AvroIO.Write
-
Returns an AvroIO.Write PTransform that forces a single file as
output.
- withoutSharding() - Method in class com.google.cloud.dataflow.sdk.io.TextIO.Write.Bound
-
Returns a new TextIO.Write PTransform that's like this one but
that forces a single file as output.
- withoutSharding() - Static method in class com.google.cloud.dataflow.sdk.io.TextIO.Write
-
Returns a TextIO.Write PTransform that forces a single file as
output.
- withoutStrictParsing() - Method in class com.google.cloud.dataflow.sdk.options.PipelineOptionsFactory.Builder
-
During parsing of the arguments, we will skip over improperly formatted and unknown
arguments.
- withoutValidation() - Method in class com.google.cloud.dataflow.sdk.io.AvroIO.Read.Bound
-
Returns a new TextIO.Read PTransform that's like this one but
that has GCS input path validation on pipeline creation disabled.
- withoutValidation() - Static method in class com.google.cloud.dataflow.sdk.io.AvroIO.Read
-
Returns a AvroIO.Read PTransform that has GCS path validation on
pipeline creation disabled.
- withoutValidation() - Method in class com.google.cloud.dataflow.sdk.io.AvroIO.Write.Bound
-
Returns a new TextIO.Write PTransform that's like this one but
that has GCS output path validation on pipeline creation disabled.
- withoutValidation() - Static method in class com.google.cloud.dataflow.sdk.io.AvroIO.Write
-
Returns a AvroIO.Write PTransform that has GCS path validation on
pipeline creation disabled.
- withoutValidation() - Method in class com.google.cloud.dataflow.sdk.io.BigQueryIO.Read.Bound
-
Disable table validation.
- withoutValidation() - Static method in class com.google.cloud.dataflow.sdk.io.BigQueryIO.Read
-
Disables BigQuery table validation which is enabled by default.
- withoutValidation() - Method in class com.google.cloud.dataflow.sdk.io.BigQueryIO.Write.Bound
-
Disable table validation.
- withoutValidation() - Static method in class com.google.cloud.dataflow.sdk.io.BigQueryIO.Write
-
Disables BigQuery table validation which is enabled by default.
- withoutValidation() - Method in class com.google.cloud.dataflow.sdk.io.TextIO.Read.Bound
-
Returns a new TextIO.Read PTransform that's like this one but
that has GCS path validation on pipeline creation disabled.
- withoutValidation() - Static method in class com.google.cloud.dataflow.sdk.io.TextIO.Read
-
Returns a TextIO.Read PTransform that has GCS path validation on
pipeline creation disabled.
- withoutValidation() - Method in class com.google.cloud.dataflow.sdk.io.TextIO.Write.Bound
-
Returns a new TextIO.Write PTransform that's like this one but
that has GCS output path validation on pipeline creation disabled.
- withoutValidation() - Static method in class com.google.cloud.dataflow.sdk.io.TextIO.Write
-
Returns a TextIO.Write PTransform that has GCS path validation on
pipeline creation disabled.
- withQuery(DatastoreV1.Query) - Method in class com.google.cloud.dataflow.sdk.io.DatastoreIO.Source
-
- withRateLimit(double) - Method in class com.google.cloud.dataflow.sdk.transforms.RateLimiting.RateLimitingTransform
-
Modifies this RateLimitingTransform, specifying a maximum
per-worker element processing rate.
- withSchema(Class<T1>) - Method in class com.google.cloud.dataflow.sdk.io.AvroIO.Read.Bound
-
Returns a new AvroIO.Read PTransform that's like this one but
that reads Avro file(s) containing records whose type is the
specified Avro-generated class.
- withSchema(Schema) - Method in class com.google.cloud.dataflow.sdk.io.AvroIO.Read.Bound
-
Returns a new AvroIO.Read PTransform that's like this one but
that reads Avro file(s) containing records of the specified schema.
- withSchema(String) - Method in class com.google.cloud.dataflow.sdk.io.AvroIO.Read.Bound
-
Returns a new AvroIO.Read PTransform that's like this one but
that reads Avro file(s) containing records of the specified schema
in a JSON-encoded string form.
- withSchema(Class<T>) - Static method in class com.google.cloud.dataflow.sdk.io.AvroIO.Read
-
Returns an AvroIO.Read PTransform that reads Avro file(s)
containing records whose type is the specified Avro-generated class.
- withSchema(Schema) - Static method in class com.google.cloud.dataflow.sdk.io.AvroIO.Read
-
Returns an AvroIO.Read PTransform that reads Avro file(s)
containing records of the specified schema.
- withSchema(String) - Static method in class com.google.cloud.dataflow.sdk.io.AvroIO.Read
-
Returns an AvroIO.Read PTransform that reads Avro file(s)
containing records of the specified schema in a JSON-encoded
string form.
- withSchema(Class<T1>) - Method in class com.google.cloud.dataflow.sdk.io.AvroIO.Write.Bound
-
Returns a new AvroIO.Write PTransform that's like this one but
that writes to Avro file(s) containing records whose type is the
specified Avro-generated class.
- withSchema(Schema) - Method in class com.google.cloud.dataflow.sdk.io.AvroIO.Write.Bound
-
Returns a new AvroIO.Write PTransform that's like this one but
that writes to Avro file(s) containing records of the specified
schema.
- withSchema(String) - Method in class com.google.cloud.dataflow.sdk.io.AvroIO.Write.Bound
-
Returns a new AvroIO.Write PTransform that's like this one but
that writes to Avro file(s) containing records of the specified
schema in a JSON-encoded string form.
- withSchema(Class<T>) - Static method in class com.google.cloud.dataflow.sdk.io.AvroIO.Write
-
Returns an AvroIO.Write PTransform that writes Avro file(s)
containing records whose type is the specified Avro-generated class.
- withSchema(Schema) - Static method in class com.google.cloud.dataflow.sdk.io.AvroIO.Write
-
Returns an AvroIO.Write PTransform that writes Avro file(s)
containing records of the specified schema.
- withSchema(String) - Static method in class com.google.cloud.dataflow.sdk.io.AvroIO.Write
-
Returns an AvroIO.Write PTransform that writes Avro file(s)
containing records of the specified schema in a JSON-encoded
string form.
- withSchema(TableSchema) - Method in class com.google.cloud.dataflow.sdk.io.BigQueryIO.Write.Bound
-
Specifies the table schema, used if the table is created.
- withSchema(TableSchema) - Static method in class com.google.cloud.dataflow.sdk.io.BigQueryIO.Write
-
Specifies a table schema to use in table creation.
- withSerializabilityTesting(boolean) - Method in class com.google.cloud.dataflow.sdk.runners.DirectPipelineRunner
-
Enable runtime testing to verify that all functions and
Coder
instances can be serialized.
- withShardNameTemplate(String) - Method in class com.google.cloud.dataflow.sdk.io.AvroIO.Write.Bound
-
Returns a new AvroIO.Write PTransform that's like this one but
that uses the given shard name template.
- withShardNameTemplate(String) - Static method in class com.google.cloud.dataflow.sdk.io.AvroIO.Write
-
Returns an AvroIO.Write PTransform that uses the given shard name
template.
- withShardNameTemplate(String) - Method in class com.google.cloud.dataflow.sdk.io.TextIO.Write.Bound
-
Returns a new TextIO.Write PTransform that's like this one but
that uses the given shard name template.
- withShardNameTemplate(String) - Static method in class com.google.cloud.dataflow.sdk.io.TextIO.Write
-
Returns a TextIO.Write PTransform that uses the given shard name
template.
- withSideInputs(PCollectionView<?, ?>...) - Method in class com.google.cloud.dataflow.sdk.transforms.ParDo.Bound
-
Returns a new ParDo PTransform that's like this
PTransform but with the specified side inputs.
- withSideInputs(Iterable<? extends PCollectionView<?, ?>>) - Method in class com.google.cloud.dataflow.sdk.transforms.ParDo.Bound
-
Returns a new ParDo PTransform that's like this
PTransform but with the specified side inputs.
- withSideInputs(PCollectionView<?, ?>...) - Method in class com.google.cloud.dataflow.sdk.transforms.ParDo.BoundMulti
-
Returns a new multi-output ParDo PTransform
that's like this PTransform but with the specified side
inputs.
- withSideInputs(Iterable<? extends PCollectionView<?, ?>>) - Method in class com.google.cloud.dataflow.sdk.transforms.ParDo.BoundMulti
-
Returns a new multi-output ParDo PTransform
that's like this PTransform but with the specified side
inputs.
- withSideInputs(PCollectionView<?, ?>...) - Method in class com.google.cloud.dataflow.sdk.transforms.ParDo.Unbound
-
Returns a new ParDo transform that's like this
transform but with the specified side inputs.
- withSideInputs(Iterable<? extends PCollectionView<?, ?>>) - Method in class com.google.cloud.dataflow.sdk.transforms.ParDo.Unbound
-
Returns a new ParDo transform that's like this
transform but with the specified side inputs.
- withSideInputs(PCollectionView<?, ?>...) - Method in class com.google.cloud.dataflow.sdk.transforms.ParDo.UnboundMulti
-
Returns a new multi-output ParDo transform that's like
this transform but with the specified side inputs.
- withSideInputs(Iterable<? extends PCollectionView<?, ?>>) - Method in class com.google.cloud.dataflow.sdk.transforms.ParDo.UnboundMulti
-
Returns a new multi-output ParDo transform that's like
this transform but with the specified side inputs.
- withSideInputs(PCollectionView<?, ?>...) - Static method in class com.google.cloud.dataflow.sdk.transforms.ParDo
-
Creates a ParDo PTransform with the given
side inputs.
- withSideInputs(Iterable<? extends PCollectionView<?, ?>>) - Static method in class com.google.cloud.dataflow.sdk.transforms.ParDo
-
Creates a ParDo with the given side inputs.
- withStartingDay(int, int, int) - Method in class com.google.cloud.dataflow.sdk.transforms.windowing.CalendarWindows.DaysWindows
-
- withStartingMonth(int, int) - Method in class com.google.cloud.dataflow.sdk.transforms.windowing.CalendarWindows.MonthsWindows
-
- withStartingYear(int) - Method in class com.google.cloud.dataflow.sdk.transforms.windowing.CalendarWindows.YearsWindows
-
- withSuffix(String) - Method in class com.google.cloud.dataflow.sdk.io.AvroIO.Write.Bound
-
Returns a new AvroIO.Write PTransform that's like this one but
that writes to the file(s) with the given filename suffix.
- withSuffix(String) - Static method in class com.google.cloud.dataflow.sdk.io.AvroIO.Write
-
Returns an AvroIO.Write PTransform that writes to the file(s) with the
given filename suffix.
- withSuffix(String) - Method in class com.google.cloud.dataflow.sdk.io.TextIO.Write.Bound
-
Returns a new TextIO.Write PTransform that's like this one but
that writes to the file(s) with the given filename suffix.
- withSuffix(String) - Static method in class com.google.cloud.dataflow.sdk.io.TextIO.Write
-
Returns a TextIO.Write PTransform that writes to the file(s) with the
given filename suffix.
- withTimeZone(DateTimeZone) - Method in class com.google.cloud.dataflow.sdk.transforms.windowing.CalendarWindows.DaysWindows
-
- withTimeZone(DateTimeZone) - Method in class com.google.cloud.dataflow.sdk.transforms.windowing.CalendarWindows.MonthsWindows
-
- withTimeZone(DateTimeZone) - Method in class com.google.cloud.dataflow.sdk.transforms.windowing.CalendarWindows.YearsWindows
-
- withUnorderednessTesting(boolean) - Method in class com.google.cloud.dataflow.sdk.runners.DirectPipelineRunner
-
Enable runtime testing to verify that functions do not depend on order
of the elements.
- withValidation() - Method in class com.google.cloud.dataflow.sdk.options.PipelineOptionsFactory.Builder
-
After creation we will validate that
PipelineOptions conforms to all the
validation criteria from
<T>.
- withValidation() - Method in class com.google.cloud.dataflow.sdk.options.PipelineOptionsFactory
-
After creation we will validate that
PipelineOptions conforms to all the
validation criteria from
<T>.
- withValue(T) - Method in class com.google.cloud.dataflow.sdk.runners.DirectPipelineRunner.ValueWithMetadata
-
Returns a new ValueWithMetadata that is a copy of this one, but with
a different value.
- withWriteDisposition(BigQueryIO.Write.WriteDisposition) - Method in class com.google.cloud.dataflow.sdk.io.BigQueryIO.Write.Bound
-
Specifies options for writing the table.
- withWriteDisposition(BigQueryIO.Write.WriteDisposition) - Static method in class com.google.cloud.dataflow.sdk.io.BigQueryIO.Write
-
Specifies options for writing to the table.
- write() - Static method in class com.google.cloud.dataflow.sdk.io.DatastoreIO
-
- writeTo(String) - Static method in class com.google.cloud.dataflow.sdk.io.DatastoreIO
-
Returns a new
DatastoreIO.Sink builder using the default host and given dataset.