Class

io.eels.component.hive

HiveFilePart

Related Doc: package hive

Permalink

class HiveFilePart extends Part

Linear Supertypes
Part, AnyRef, Any
Ordering
  1. Alphabetic
  2. By Inheritance
Inherited
  1. HiveFilePart
  2. Part
  3. AnyRef
  4. Any
  1. Hide All
  2. Show All
Visibility
  1. Public
  2. All

Instance Constructors

  1. new HiveFilePart(dialect: HiveDialect, file: LocatedFileStatus, metastoreSchema: StructType, projectionSchema: StructType, predicate: Option[Predicate], partitions: List[PartitionPart])(implicit fs: FileSystem, conf: Configuration)

    Permalink

    metastoreSchema

    the schema as present in the metastore and used to match up with the raw data in dialects where the schema is not present. For example with a CSV format in Hive, the metastoreSchema is required in order to know what each column represents. We can't use the projection schema for this because the projection schema might be in a different order.

    projectionSchema

    the schema actually required, optional in which case the metastoreSchema will be used. The reason the projectionSchema is pushed down to the dialects rather than being applied after is because some file schemas can read data more efficiently if they know they can omit some fields (eg Parquet).

    predicate

    predicate for filtering rows, is pushed down to the parquet reader for efficiency if the predicate can operate on the files.

    partitions

    a list of partition key-values for this file. We require this to repopulate the partition values when creating the final Row.

Value Members

  1. final def !=(arg0: Any): Boolean

    Permalink
    Definition Classes
    AnyRef → Any
  2. final def ##(): Int

    Permalink
    Definition Classes
    AnyRef → Any
  3. final def ==(arg0: Any): Boolean

    Permalink
    Definition Classes
    AnyRef → Any
  4. final def asInstanceOf[T0]: T0

    Permalink
    Definition Classes
    Any
  5. def clone(): AnyRef

    Permalink
    Attributes
    protected[java.lang]
    Definition Classes
    AnyRef
    Annotations
    @throws( ... )
  6. val dialect: HiveDialect

    Permalink
  7. final def eq(arg0: AnyRef): Boolean

    Permalink
    Definition Classes
    AnyRef
  8. def equals(arg0: Any): Boolean

    Permalink
    Definition Classes
    AnyRef → Any
  9. val file: LocatedFileStatus

    Permalink
  10. def finalize(): Unit

    Permalink
    Attributes
    protected[java.lang]
    Definition Classes
    AnyRef
    Annotations
    @throws( classOf[java.lang.Throwable] )
  11. final def getClass(): Class[_]

    Permalink
    Definition Classes
    AnyRef → Any
  12. def hashCode(): Int

    Permalink
    Definition Classes
    AnyRef → Any
  13. final def isInstanceOf[T0]: Boolean

    Permalink
    Definition Classes
    Any
  14. def iterator(): CloseableIterator[Seq[Row]]

    Permalink
    Definition Classes
    HiveFilePart → Part
  15. val metastoreSchema: StructType

    Permalink

    the schema as present in the metastore and used to match up with the raw data in dialects where the schema is not present.

    the schema as present in the metastore and used to match up with the raw data in dialects where the schema is not present. For example with a CSV format in Hive, the metastoreSchema is required in order to know what each column represents. We can't use the projection schema for this because the projection schema might be in a different order.

  16. final def ne(arg0: AnyRef): Boolean

    Permalink
    Definition Classes
    AnyRef
  17. final def notify(): Unit

    Permalink
    Definition Classes
    AnyRef
  18. final def notifyAll(): Unit

    Permalink
    Definition Classes
    AnyRef
  19. val partitions: List[PartitionPart]

    Permalink

    a list of partition key-values for this file.

    a list of partition key-values for this file. We require this to repopulate the partition values when creating the final Row.

  20. val predicate: Option[Predicate]

    Permalink

    predicate for filtering rows, is pushed down to the parquet reader for efficiency if the predicate can operate on the files.

  21. val projectionSchema: StructType

    Permalink

    the schema actually required, optional in which case the metastoreSchema will be used.

    the schema actually required, optional in which case the metastoreSchema will be used. The reason the projectionSchema is pushed down to the dialects rather than being applied after is because some file schemas can read data more efficiently if they know they can omit some fields (eg Parquet).

  22. final def synchronized[T0](arg0: ⇒ T0): T0

    Permalink
    Definition Classes
    AnyRef
  23. def toString(): String

    Permalink
    Definition Classes
    AnyRef → Any
  24. final def wait(): Unit

    Permalink
    Definition Classes
    AnyRef
    Annotations
    @throws( ... )
  25. final def wait(arg0: Long, arg1: Int): Unit

    Permalink
    Definition Classes
    AnyRef
    Annotations
    @throws( ... )
  26. final def wait(arg0: Long): Unit

    Permalink
    Definition Classes
    AnyRef
    Annotations
    @throws( ... )

Inherited from Part

Inherited from AnyRef

Inherited from Any

Ungrouped