org.apache.spark.sql.execution.datasources

CatalogFileIndex

class CatalogFileIndex extends FileIndex

A FileIndex for a metastore catalog table.

Linear Supertypes
FileIndex, AnyRef, Any
Ordering
  1. Alphabetic
  2. By inheritance
Inherited
  1. CatalogFileIndex
  2. FileIndex
  3. AnyRef
  4. Any
  1. Hide All
  2. Show all
Learn more about member selection
Visibility
  1. Public
  2. All

Instance Constructors

  1. new CatalogFileIndex(sparkSession: SparkSession, table: CatalogTable, sizeInBytes: Long)

    sparkSession

    a SparkSession

    table

    the metadata of the table

    sizeInBytes

    the table's data size in bytes

Value Members

  1. final def !=(arg0: AnyRef): Boolean

    Definition Classes
    AnyRef
  2. final def !=(arg0: Any): Boolean

    Definition Classes
    Any
  3. final def ##(): Int

    Definition Classes
    AnyRef → Any
  4. final def ==(arg0: AnyRef): Boolean

    Definition Classes
    AnyRef
  5. final def ==(arg0: Any): Boolean

    Definition Classes
    Any
  6. final def asInstanceOf[T0]: T0

    Definition Classes
    Any
  7. def clone(): AnyRef

    Attributes
    protected[java.lang]
    Definition Classes
    AnyRef
    Annotations
    @throws( ... )
  8. final def eq(arg0: AnyRef): Boolean

    Definition Classes
    AnyRef
  9. def equals(o: Any): Boolean

    Definition Classes
    CatalogFileIndex → AnyRef → Any
  10. def filterPartitions(filters: Seq[Expression]): InMemoryFileIndex

    Returns a InMemoryFileIndex for this table restricted to the subset of partitions specified by the given partition-pruning filters.

    Returns a InMemoryFileIndex for this table restricted to the subset of partitions specified by the given partition-pruning filters.

    filters

    partition-pruning filters

  11. def finalize(): Unit

    Attributes
    protected[java.lang]
    Definition Classes
    AnyRef
    Annotations
    @throws( classOf[java.lang.Throwable] )
  12. final def getClass(): Class[_]

    Definition Classes
    AnyRef → Any
  13. val hadoopConf: Configuration

    Attributes
    protected
  14. def hashCode(): Int

    Definition Classes
    CatalogFileIndex → AnyRef → Any
  15. def inputFiles: Array[String]

    Returns the list of files that will be read when scanning this relation.

    Returns the list of files that will be read when scanning this relation. This call may be very expensive for large tables.

    Definition Classes
    CatalogFileIndexFileIndex
  16. final def isInstanceOf[T0]: Boolean

    Definition Classes
    Any
  17. def listFiles(partitionFilters: Seq[Expression], dataFilters: Seq[Expression]): Seq[PartitionDirectory]

    Returns all valid files grouped into partitions when the data is partitioned.

    Returns all valid files grouped into partitions when the data is partitioned. If the data is unpartitioned, this will return a single partition with no partition values.

    partitionFilters

    The filters used to prune which partitions are returned. These filters must only refer to partition columns and this method will only return files where these predicates are guaranteed to evaluate to true. Thus, these filters will not need to be evaluated again on the returned data.

    dataFilters

    Filters that can be applied on non-partitioned columns. The implementation does not need to guarantee these filters are applied, i.e. the execution engine will ensure these filters are still applied on the returned files.

    Definition Classes
    CatalogFileIndexFileIndex
  18. def metadataOpsTimeNs: Option[Long]

    Returns an optional metadata operation time, in nanoseconds, for listing files.

    Returns an optional metadata operation time, in nanoseconds, for listing files.

    We do file listing in query optimization (in order to get the proper statistics) and we want to account for file listing time in physical execution (as metrics). To do that, we save the file listing time in some implementations and physical execution calls it in this method to update the metrics.

    Definition Classes
    FileIndex
  19. final def ne(arg0: AnyRef): Boolean

    Definition Classes
    AnyRef
  20. final def notify(): Unit

    Definition Classes
    AnyRef
  21. final def notifyAll(): Unit

    Definition Classes
    AnyRef
  22. def partitionSchema: StructType

    Schema of the partitioning columns, or the empty schema if the table is not partitioned.

    Schema of the partitioning columns, or the empty schema if the table is not partitioned.

    Definition Classes
    CatalogFileIndexFileIndex
  23. def refresh(): Unit

    Refresh any cached file listings

    Refresh any cached file listings

    Definition Classes
    CatalogFileIndexFileIndex
  24. def rootPaths: Seq[Path]

    Returns the list of root input paths from which the catalog will get files.

    Returns the list of root input paths from which the catalog will get files. There may be a single root path from which partitions are discovered, or individual partitions may be specified by each path.

    Definition Classes
    CatalogFileIndexFileIndex
  25. val sizeInBytes: Long

    the table's data size in bytes

    the table's data size in bytes

    Definition Classes
    CatalogFileIndexFileIndex
  26. final def synchronized[T0](arg0: ⇒ T0): T0

    Definition Classes
    AnyRef
  27. val table: CatalogTable

    the metadata of the table

  28. def toString(): String

    Definition Classes
    AnyRef → Any
  29. final def wait(): Unit

    Definition Classes
    AnyRef
    Annotations
    @throws( ... )
  30. final def wait(arg0: Long, arg1: Int): Unit

    Definition Classes
    AnyRef
    Annotations
    @throws( ... )
  31. final def wait(arg0: Long): Unit

    Definition Classes
    AnyRef
    Annotations
    @throws( ... )

Inherited from FileIndex

Inherited from AnyRef

Inherited from Any

Ungrouped