hive

Type Members

trait AlignmentStrategy extends AnyRef
trait CommitCallback extends AnyRef
trait EvolutionStrategy extends AnyRef

A strategy that determines how a hive metastore schema is evolved for a given target schema.
A strategy that determines how a hive metastore schema is evolved for a given target schema.
For example, a strategy may choose to alter the hive table to add any missing columns. Or it may choose to abort a write by throwing an exception. Or it may choose to leave the schema as is and drop the columns from the input rows.
trait FileListener extends AnyRef
trait FilenameStrategy extends AnyRef

Strategy responsible for the filenames created by eel when writing out data.
class HiveContext extends AnyRef
case class HiveDatabase(dbName: String)(implicit fs: FileSystem, client: IMetaStoreClient) extends Product with Serializable
case class HiveDatasetUri(db: String, table: String) extends Product with Serializable
trait HiveDialect extends Logging
class HiveFilePublisher extends Publisher[Seq[Row]] with Using
trait HiveFormat extends AnyRef
class HiveOps extends Logging
trait HiveOutputStream extends AnyRef
class HivePartitionPublisher extends Publisher[Seq[Row]] with Logging

A Hive Part that can read values from the metastore, rather than reading values from files.
A Hive Part that can read values from the metastore, rather than reading values from files. This can be used only when the requested fields are all partition keys.
class HivePartitionScanner extends Logging
case class HiveSink(dbName: String, tableName: String, permission: Option[FsPermission] = None, inheritPermissions: Option[Boolean] = None, principal: Option[String] = None, format: Option[HiveFormat] = None, partitionFields: Seq[String] = Nil, partitionStrategy: PartitionStrategy = new DynamicPartitionStrategy, filenameStrategy: FilenameStrategy = DefaultFilenameStrategy, stagingStrategy: StagingStrategy = DefaultStagingStrategy, evolutionStrategy: EvolutionStrategy = AdditionEvolutionStrategy, alignStrategy: AlignmentStrategy = RowPaddingAlignmentStrategy, outputSchemaStrategy: OutputSchemaStrategy = SkipPartitionsOutputSchemaStrategy, keytabPath: Option[Path] = None, fileListener: FileListener = FileListener.noop, createTable: Boolean = false, callbacks: Seq[CommitCallback] = Nil, roundingMode: RoundingMode = RoundingMode.UNNECESSARY, metadata: Map[String, String] = Map.empty)(implicit fs: FileSystem, client: IMetaStoreClient) extends Sink with Logging with Product with Serializable
class HiveSinkWriter extends SinkWriter with Logging
case class HiveSource(dbName: String, tableName: String, projection: List[String] = Nil, predicate: Option[Predicate] = None, partitionConstraints: Seq[PartitionConstraint] = Nil, principal: Option[String] = None, keytabPath: Option[Path] = None)(implicit fs: FileSystem, client: IMetaStoreClient) extends Source with Logging with Using with Product with Serializable

projection
sets which fields are required by the caller.
predicate
optional predicate which will filter rows at the read level
trait HiveStats extends AnyRef
case class HiveTable(dbName: String, tableName: String)(implicit fs: FileSystem, conf: Configuration, client: IMetaStoreClient) extends Logging with Product with Serializable
trait OutputSchemaStrategy extends AnyRef

Accepts a metastore schema and returns the schema that should actually be persisted to disk.
Accepts a metastore schema and returns the schema that should actually be persisted to disk. This allows us to determine if some data is not written, for example in parquet files it is common to skip writing out partition data, since that data is present in the metastore.
class ParquetHiveStats extends HiveStats with Logging
case class PartitionColumn(name: String, dataType: DataType = StringType) extends Product with Serializable
trait StagingStrategy extends AnyRef
case class TableSpec(tableName: String, tableType: TableType, location: String, cols: Seq[FieldSchema], numBuckets: Int, bucketNames: List[String], params: Map[String, String], inputFormat: String, outputFormat: String, serde: String, retention: Int, createTime: Long, lastAccessTime: Long, owner: String) extends Product with Serializable

Value Members

object AdditionEvolutionStrategy extends EvolutionStrategy with Logging

The AdditionEvolutionStrategy will add any missing fields to the schema in the hive metastore.
The AdditionEvolutionStrategy will add any missing fields to the schema in the hive metastore. It will not check that any existing fields are of the same type as in the metastore. The new fields cannot be added as partition fields.
object DefaultFilenameStrategy extends FilenameStrategy
object DefaultStagingStrategy extends StagingStrategy
object FileListener
object HiveDDL
object HiveDatasetUri extends Serializable
object HiveDialect extends Logging
object HiveFileScanner extends Logging
object HiveFormat
object HiveSchemaFns extends Logging
object HiveSink extends Serializable
object HiveTableFilesFn extends Logging

Locates files for a given table.
Locates files for a given table.
Connects to the hive metastore to get the partitions list (or if no partitions then just root) and scans those directories.
Returns a Map of each partition to the files in that partition.
If partition constraints are specified then those partitions are filtered out.
object RowPaddingAlignmentStrategy extends AlignmentStrategy

An AlignmentStrategy that will use default values, or nulls, to pad out rows to match the target schema.
object SkipPartitionsOutputSchemaStrategy extends OutputSchemaStrategy

This strategy will drop partition columns from the schema so that they not written out to the files.
package dialect
package partition

package hive

Type Members

trait AlignmentStrategy extends AnyRef

trait CommitCallback extends AnyRef

trait EvolutionStrategy extends AnyRef

trait FileListener extends AnyRef

trait FilenameStrategy extends AnyRef

class HiveContext extends AnyRef

case class HiveDatabase(dbName: String)(implicit fs: FileSystem, client: IMetaStoreClient) extends Product with Serializable

case class HiveDatasetUri(db: String, table: String) extends Product with Serializable

trait HiveDialect extends Logging

class HiveFilePublisher extends Publisher[Seq[Row]] with Using

trait HiveFormat extends AnyRef

class HiveOps extends Logging

trait HiveOutputStream extends AnyRef

class HivePartitionPublisher extends Publisher[Seq[Row]] with Logging

class HivePartitionScanner extends Logging

class HiveSinkWriter extends SinkWriter with Logging

trait HiveStats extends AnyRef

case class HiveTable(dbName: String, tableName: String)(implicit fs: FileSystem, conf: Configuration, client: IMetaStoreClient) extends Logging with Product with Serializable

trait OutputSchemaStrategy extends AnyRef

class ParquetHiveStats extends HiveStats with Logging

case class PartitionColumn(name: String, dataType: DataType = StringType) extends Product with Serializable

trait StagingStrategy extends AnyRef

Value Members

object AdditionEvolutionStrategy extends EvolutionStrategy with Logging

object DefaultFilenameStrategy extends FilenameStrategy

object DefaultStagingStrategy extends StagingStrategy

object FileListener

object HiveDDL

object HiveDatasetUri extends Serializable

object HiveDialect extends Logging

object HiveFileScanner extends Logging

object HiveFormat

object HiveSchemaFns extends Logging

object HiveSink extends Serializable

object HiveTableFilesFn extends Logging

object RowPaddingAlignmentStrategy extends AlignmentStrategy

object SkipPartitionsOutputSchemaStrategy extends OutputSchemaStrategy

package dialect

package partition

Ungrouped