Object

za.co.absa.atum.utils

ExecutionPlanUtils

Related Doc: package utils

Permalink

object ExecutionPlanUtils

This object contains utils for traversing execution plan DAG to infer control measurement input/output paths

Linear Supertypes
AnyRef, Any
Ordering
  1. Alphabetic
  2. By Inheritance
Inherited
  1. ExecutionPlanUtils
  2. AnyRef
  3. Any
  1. Hide All
  2. Show All
Visibility
  1. Public
  2. All

Value Members

  1. final def !=(arg0: Any): Boolean

    Permalink
    Definition Classes
    AnyRef → Any
  2. final def ##(): Int

    Permalink
    Definition Classes
    AnyRef → Any
  3. final def ==(arg0: Any): Boolean

    Permalink
    Definition Classes
    AnyRef → Any
  4. final def asInstanceOf[T0]: T0

    Permalink
    Definition Classes
    Any
  5. def clone(): AnyRef

    Permalink
    Attributes
    protected[java.lang]
    Definition Classes
    AnyRef
    Annotations
    @throws( ... )
  6. final def eq(arg0: AnyRef): Boolean

    Permalink
    Definition Classes
    AnyRef
  7. def equals(arg0: Any): Boolean

    Permalink
    Definition Classes
    AnyRef → Any
  8. def finalize(): Unit

    Permalink
    Attributes
    protected[java.lang]
    Definition Classes
    AnyRef
    Annotations
    @throws( classOf[java.lang.Throwable] )
  9. final def getClass(): Class[_]

    Permalink
    Definition Classes
    AnyRef → Any
  10. def getHadoopFullPath(path: Path, hadoopConfiguration: Configuration): Path

    Permalink
  11. def getSourceFileNames(plan: LogicalPlan): List[Path]

    Permalink

    The method returns source file names of a DataSet execution plan by traversing the DAG.

    The method returns source file names of a DataSet execution plan by traversing the DAG. Thanks za.co.absa.spline.core

    plan

    A logical plan of execution

    returns

    The list of input files paths

  12. def hashCode(): Int

    Permalink
    Definition Classes
    AnyRef → Any
  13. def inferInputInfoFilePath(dataset: Dataset[Row], infoFileName: String = Constants.DefaultInfoFileName)(implicit inputFs: FileSystem): Path

    Permalink

    The method returns input control measurements info file name inferred from the source dataset Ensure one and only one input pathname has input control measurements

    The method returns input control measurements info file name inferred from the source dataset Ensure one and only one input pathname has input control measurements

    dataset

    A dataset where input path name(s) will be searched

    infoFileName

    A file name of an info file, e.g. "_INFO"

    returns

    The inferred input control measurements file path of the source dataset

  14. def inferOutputFileName(qe: QueryExecution, hadoopConfiguration: Configuration): Option[Path]

    Permalink

    The method returns output file name inferred from the source dataset

    The method returns output file name inferred from the source dataset

    qe

    A query execution object where output path name will be searched

    returns

    The inferred output control measurements file path of the source dataset

  15. def inferOutputInfoFileDir(qe: QueryExecution): Option[String]

    Permalink

    Based on the qe supplied, output _INFO file path is inference is attempted

    Based on the qe supplied, output _INFO file path is inference is attempted

    qe

    QueryExecution - path inference basis

    returns

    optional inferred _INFO file path

  16. def inferOutputInfoFileName(qe: QueryExecution, infoFileName: String = Constants.DefaultInfoFileName): Option[Path]

    Permalink

    The method returns output control measurements info file name inferred from the source dataset on HDFS

    The method returns output control measurements info file name inferred from the source dataset on HDFS

    qe

    A query execution object where output path name will be searched

    infoFileName

    A file name of an info file, e.g. "_INFO"

    returns

    The inferred output control measurements file path of the source dataset

  17. def inferOutputInfoFileNameOnS3(qe: QueryExecution, infoFileName: String = Constants.DefaultInfoFileName): Option[String]

    Permalink

    The method returns output control measurements info file name inferred from the source dataset on S3

    The method returns output control measurements info file name inferred from the source dataset on S3

    qe

    A query execution object where output path name will be searched

    infoFileName

    A file name of an info file, e.g. "_INFO"

    returns

    The inferred output control measurements file path of the source dataset

  18. final def isInstanceOf[T0]: Boolean

    Permalink
    Definition Classes
    Any
  19. final def ne(arg0: AnyRef): Boolean

    Permalink
    Definition Classes
    AnyRef
  20. final def notify(): Unit

    Permalink
    Definition Classes
    AnyRef
  21. final def notifyAll(): Unit

    Permalink
    Definition Classes
    AnyRef
  22. final def synchronized[T0](arg0: ⇒ T0): T0

    Permalink
    Definition Classes
    AnyRef
  23. def toString(): String

    Permalink
    Definition Classes
    AnyRef → Any
  24. final def wait(): Unit

    Permalink
    Definition Classes
    AnyRef
    Annotations
    @throws( ... )
  25. final def wait(arg0: Long, arg1: Int): Unit

    Permalink
    Definition Classes
    AnyRef
    Annotations
    @throws( ... )
  26. final def wait(arg0: Long): Unit

    Permalink
    Definition Classes
    AnyRef
    Annotations
    @throws( ... )

Inherited from AnyRef

Inherited from Any

Ungrouped