Class Regex.Find

  • All Implemented Interfaces:
    java.io.Serializable, HasDisplayData
    Enclosing class:
    Regex

    public static class Regex.Find
    extends PTransform<PCollection<java.lang.String>,​PCollection<java.lang.String>>
    Regex.Find<String> takes a PCollection<String> and returns a PCollection<String> representing the value extracted from the Regex groups of the input PCollection to the number of times that element occurs in the input.

    This transform runs a Regex on the entire input line. If a portion of the line does not match the Regex, the line will not be output. If it does match a portion of the line, the group in the Regex will be used. The output will be the Regex group.

    Example of use:

    
     PCollection<String> words = ...;
     PCollection<String> values =
         words.apply(Regex.find("myregex (mygroup)", 1));
     
    See Also:
    Serialized Form
    • Constructor Detail

      • Find

        public Find​(java.util.regex.Pattern pattern,
                    int group)
    • Method Detail

      • expand

        public PCollection<java.lang.String> expand​(PCollection<java.lang.String> in)
        Description copied from class: PTransform
        Override this method to specify how this PTransform should be expanded on the given InputT.

        NOTE: This method should not be called directly. Instead apply the PTransform should be applied to the InputT using the apply method.

        Composite transforms, which are defined in terms of other transforms, should return the output of one of the composed transforms. Non-composite transforms, which do not apply any transforms internally, should return a new unbound output and register evaluators (via backend-specific registration methods).

        Specified by:
        expand in class PTransform<PCollection<java.lang.String>,​PCollection<java.lang.String>>