Class Regex.Split
- java.lang.Object
-
- org.apache.beam.sdk.transforms.PTransform<PCollection<java.lang.String>,PCollection<java.lang.String>>
-
- org.apache.beam.sdk.transforms.Regex.Split
-
- All Implemented Interfaces:
java.io.Serializable
,HasDisplayData
- Enclosing class:
- Regex
public static class Regex.Split extends PTransform<PCollection<java.lang.String>,PCollection<java.lang.String>>
Regex.Split<String>
takes aPCollection<String>
and returns aPCollection<String>
with the input string split into individual items in a list. Each item is then output as a separate string.This transform runs a Regex as part of a splint the entire input line. The split gives back an array of items. Each item is output as a separate item in the
PCollection<String>
.Depending on the Regex, a split can be an empty or "" string. You can pass in a parameter if you want empty strings or not.
Example of use:
PCollection<String> words = ...; PCollection<String> values = words.apply(Regex.split("\W*"));
- See Also:
- Serialized Form
-
-
Field Summary
-
Fields inherited from class org.apache.beam.sdk.transforms.PTransform
name, resourceHints
-
-
Constructor Summary
Constructors Constructor Description Split(java.util.regex.Pattern pattern, boolean outputEmpty)
-
Method Summary
All Methods Instance Methods Concrete Methods Modifier and Type Method Description PCollection<java.lang.String>
expand(PCollection<java.lang.String> in)
Override this method to specify how thisPTransform
should be expanded on the givenInputT
.-
Methods inherited from class org.apache.beam.sdk.transforms.PTransform
compose, compose, getAdditionalInputs, getDefaultOutputCoder, getDefaultOutputCoder, getDefaultOutputCoder, getKindString, getName, getResourceHints, populateDisplayData, setResourceHints, toString, validate, validate
-
-
-
-
Method Detail
-
expand
public PCollection<java.lang.String> expand(PCollection<java.lang.String> in)
Description copied from class:PTransform
Override this method to specify how thisPTransform
should be expanded on the givenInputT
.NOTE: This method should not be called directly. Instead apply the
PTransform
should be applied to theInputT
using theapply
method.Composite transforms, which are defined in terms of other transforms, should return the output of one of the composed transforms. Non-composite transforms, which do not apply any transforms internally, should return a new unbound output and register evaluators (via backend-specific registration methods).
- Specified by:
expand
in classPTransform<PCollection<java.lang.String>,PCollection<java.lang.String>>
-
-