Class Group.ByFields<InputT>

  • All Implemented Interfaces:
    java.io.Serializable, HasDisplayData
    Enclosing class:
    Group

    public abstract static class Group.ByFields<InputT>
    extends Group.AggregateCombiner<InputT>
    a PTransform that groups schema elements based on the given fields.

    The output of this transform will have a key field of type Row containing the specified extracted fields. It will also have a value field of type Row containing the specified extracted fields.

    See Also:
    Serialized Form
    • Constructor Detail

      • ByFields

        public ByFields()
    • Method Detail

      • getToKvs

        public org.apache.beam.sdk.schemas.transforms.Group.ByFields.ToKv getToKvs()
      • aggregateField

        public <CombineInputT,​AccumT,​CombineOutputT> Group.CombineFieldsByFields<InputT> aggregateField​(java.lang.String inputFieldName,
                                                                                                                    Combine.CombineFn<CombineInputT,​AccumT,​CombineOutputT> fn,
                                                                                                                    java.lang.String outputFieldName)
        Build up an aggregation function over the input elements.

        This method specifies an aggregation over single field of the input. The union of all calls to aggregateField and aggregateFields will determine the output schema.

        Field types in the output schema will be inferred from the provided combine function. Sometimes the field type cannot be inferred due to Java's type erasure. In that case, use the overload that allows setting the output field type explicitly.

      • aggregateFieldBaseValue

        public <CombineInputT,​AccumT,​CombineOutputT> Group.CombineFieldsByFields<InputT> aggregateFieldBaseValue​(java.lang.String inputFieldName,
                                                                                                                             Combine.CombineFn<CombineInputT,​AccumT,​CombineOutputT> fn,
                                                                                                                             java.lang.String outputFieldName)
      • aggregateFieldBaseValue

        public <CombineInputT,​AccumT,​CombineOutputT> Group.CombineFieldsByFields<InputT> aggregateFieldBaseValue​(int inputFieldId,
                                                                                                                             Combine.CombineFn<CombineInputT,​AccumT,​CombineOutputT> fn,
                                                                                                                             java.lang.String outputFieldName)
      • aggregateField

        public <CombineInputT,​AccumT,​CombineOutputT> Group.CombineFieldsByFields<InputT> aggregateField​(java.lang.String inputFieldName,
                                                                                                                    Combine.CombineFn<CombineInputT,​AccumT,​CombineOutputT> fn,
                                                                                                                    Schema.Field outputField)
        Build up an aggregation function over the input elements.

        This method specifies an aggregation over single field of the input. The union of all calls to aggregateField and aggregateFields will determine the output schema.

        Specified by:
        aggregateField in class Group.AggregateCombiner<InputT>
      • aggregateFields

        public <CombineInputT,​AccumT,​CombineOutputT> Group.CombineFieldsByFields<InputT> aggregateFields​(java.util.List<java.lang.String> inputFieldNames,
                                                                                                                     Combine.CombineFn<CombineInputT,​AccumT,​CombineOutputT> fn,
                                                                                                                     java.lang.String outputFieldName)
        Build up an aggregation function over the input elements.

        This method specifies an aggregation over multiple fields of the input. The union of all calls to aggregateField and aggregateFields will determine the output schema.

        Field types in the output schema will be inferred from the provided combine function. Sometimes the field type cannot be inferred due to Java's type erasure. In that case, use the overload that allows setting the output field type explicitly.

      • aggregateFieldsById

        public <CombineInputT,​AccumT,​CombineOutputT> Group.CombineFieldsByFields<InputT> aggregateFieldsById​(java.util.List<java.lang.Integer> inputFieldIds,
                                                                                                                         Combine.CombineFn<CombineInputT,​AccumT,​CombineOutputT> fn,
                                                                                                                         java.lang.String outputFieldName)
      • aggregateFields

        public <CombineInputT,​AccumT,​CombineOutputT> Group.CombineFieldsByFields<InputT> aggregateFields​(FieldAccessDescriptor fieldsToAggregate,
                                                                                                                     Combine.CombineFn<CombineInputT,​AccumT,​CombineOutputT> fn,
                                                                                                                     java.lang.String outputFieldName)
        Build up an aggregation function over the input elements.

        This method specifies an aggregation over multiple fields of the input. The union of all calls to aggregateField and aggregateFields will determine the output schema.

        Field types in the output schema will be inferred from the provided combine function. Sometimes the field type cannot be inferred due to Java's type erasure. In that case, use the overload that allows setting the output field type explicitly.

      • aggregateFields

        public <CombineInputT,​AccumT,​CombineOutputT> Group.CombineFieldsByFields<InputT> aggregateFields​(java.util.List<java.lang.String> inputFieldNames,
                                                                                                                     Combine.CombineFn<CombineInputT,​AccumT,​CombineOutputT> fn,
                                                                                                                     Schema.Field outputField)
        Build up an aggregation function over the input elements.

        This method specifies an aggregation over multiple fields of the input. The union of all calls to aggregateField and aggregateFields will determine the output schema.

      • aggregateFieldsById

        public <CombineInputT,​AccumT,​CombineOutputT> Group.CombineFieldsByFields<InputT> aggregateFieldsById​(java.util.List<java.lang.Integer> inputFieldIds,
                                                                                                                         Combine.CombineFn<CombineInputT,​AccumT,​CombineOutputT> fn,
                                                                                                                         Schema.Field outputField)
        Description copied from class: Group.AggregateCombiner
        Build up an aggregation function over the input elements by field id.

        This method specifies an aggregation over multiple fields of the input. The union of all calls to aggregateField and aggregateFields will determine the output schema.

        Field types in the output schema will be inferred from the provided combine function. Sometimes the field type cannot be inferred due to Java's type erasure. In that case, use the overload that allows setting the output field type explicitly.

        Specified by:
        aggregateFieldsById in class Group.AggregateCombiner<InputT>
      • aggregateFields

        public <CombineInputT,​AccumT,​CombineOutputT> Group.CombineFieldsByFields<InputT> aggregateFields​(FieldAccessDescriptor fieldsToAggregate,
                                                                                                                     Combine.CombineFn<CombineInputT,​AccumT,​CombineOutputT> fn,
                                                                                                                     Schema.Field outputField)
        Build up an aggregation function over the input elements.

        This method specifies an aggregation over multiple fields of the input. The union of all calls to aggregateField and aggregateFields will determine the output schema.

      • expand

        public PCollection<Row> expand​(PCollection<InputT> input)
        Description copied from class: PTransform
        Override this method to specify how this PTransform should be expanded on the given InputT.

        NOTE: This method should not be called directly. Instead apply the PTransform should be applied to the InputT using the apply method.

        Composite transforms, which are defined in terms of other transforms, should return the output of one of the composed transforms. Non-composite transforms, which do not apply any transforms internally, should return a new unbound output and register evaluators (via backend-specific registration methods).

        Specified by:
        expand in class PTransform<PCollection<InputT>,​PCollection<Row>>