io.smartdatalake.workflow.action.sparktransformer
Function to be implemented to define the transformation between an input and output DataFrame (1:1)
Optional function to implement validations in prepare phase.
Optional function to define the transformation of input to output partition values.
Optional function to define the transformation of input to output partition values. For example this enables to implement aggregations where multiple input partitions are combined into one output partition. Note that the default value is input = output partition values, which should be correct for most use cases.
id of the action which executes this transformation. This is mainly used to prefix error messages.
partition values to transform
Map of input to output partition values. This allows to map partition values forward and backward, which is needed in execution modes. Return None if mapping is 1:1.
Interface to implement Spark-DataFrame transformers working with one input and one output (1:1)