shark.tachyon

TachyonStorageClient

class TachyonStorageClient extends OffHeapStorageClient with LogHelper

Enables caching of Shark tables in native column-oriented format into Tachyon.

The directory structure for a given table in Tachyon looks like: Data: warehouse/database.table/_defaultkey/insert_#/col_#/part_# Metadata: warehouse/database.table/_defaultkey/insert_#/.meta where:

Note that "_defaultkey" is the name of the Hive Partition for a non-partitioned table. If the table is partitioned, it will be replaced by each "hivePartitionKey".

Linear Supertypes
Ordering
  1. Alphabetic
  2. By inheritance
Inherited
  1. TachyonStorageClient
  2. LogHelper
  3. Logging
  4. OffHeapStorageClient
  5. AnyRef
  6. Any
  1. Hide All
  2. Show all
Learn more about member selection
Visibility
  1. Public
  2. All

Instance Constructors

  1. new TachyonStorageClient(master: String, warehousePath: String)

Value Members

  1. final def !=(arg0: AnyRef): Boolean

    Definition Classes
    AnyRef
  2. final def !=(arg0: Any): Boolean

    Definition Classes
    Any
  3. final def ##(): Int

    Definition Classes
    AnyRef → Any
  4. final def ==(arg0: AnyRef): Boolean

    Definition Classes
    AnyRef
  5. final def ==(arg0: Any): Boolean

    Definition Classes
    Any
  6. final def asInstanceOf[T0]: T0

    Definition Classes
    Any
  7. def clone(): AnyRef

    Attributes
    protected[java.lang]
    Definition Classes
    AnyRef
    Annotations
    @throws( ... )
  8. def createTablePartition(tableKey: String, hivePartitionKeyOpt: Option[String]): Boolean

  9. def createTablePartitionWriter(tableKey: String, hivePartitionKey: Option[String], numColumns: Int): TachyonOffHeapTableWriter

  10. def dropTable(tableKey: String): Boolean

  11. def dropTablePartition(tableKey: String, hivePartitionKey: Option[String]): Boolean

  12. final def eq(arg0: AnyRef): Boolean

    Definition Classes
    AnyRef
  13. def equals(arg0: Any): Boolean

    Definition Classes
    AnyRef → Any
  14. def errStream(): PrintStream

    Definition Classes
    LogHelper
  15. def finalize(): Unit

    Attributes
    protected[java.lang]
    Definition Classes
    AnyRef
    Annotations
    @throws( classOf[java.lang.Throwable] )
  16. final def getClass(): Class[_]

    Definition Classes
    AnyRef → Any
  17. def hashCode(): Int

    Definition Classes
    AnyRef → Any
  18. final def isInstanceOf[T0]: Boolean

    Definition Classes
    Any
  19. def isTraceEnabled(): Boolean

    Attributes
    protected
    Definition Classes
    Logging
  20. def log: Logger

    Attributes
    protected
    Definition Classes
    Logging
  21. def logDebug(msg: ⇒ String, throwable: Throwable): Unit

    Attributes
    protected
    Definition Classes
    Logging
  22. def logDebug(msg: ⇒ String): Unit

    Attributes
    protected
    Definition Classes
    Logging
  23. def logError(msg: String, exception: Throwable): Unit

    Definition Classes
    LogHelper
  24. def logError(msg: String, detail: String): Unit

    Definition Classes
    LogHelper
  25. def logError(msg: ⇒ String): Unit

    Definition Classes
    LogHelper → Logging
  26. def logError(msg: ⇒ String, throwable: Throwable): Unit

    Attributes
    protected
    Definition Classes
    Logging
  27. def logInfo(msg: ⇒ String, throwable: Throwable): Unit

    Attributes
    protected
    Definition Classes
    Logging
  28. def logInfo(msg: ⇒ String): Unit

    Attributes
    protected
    Definition Classes
    Logging
  29. def logTrace(msg: ⇒ String, throwable: Throwable): Unit

    Attributes
    protected
    Definition Classes
    Logging
  30. def logTrace(msg: ⇒ String): Unit

    Attributes
    protected
    Definition Classes
    Logging
  31. def logWarning(msg: ⇒ String, throwable: Throwable): Unit

    Attributes
    protected
    Definition Classes
    Logging
  32. def logWarning(msg: ⇒ String): Unit

    Attributes
    protected
    Definition Classes
    Logging
  33. val master: String

  34. final def ne(arg0: AnyRef): Boolean

    Definition Classes
    AnyRef
  35. final def notify(): Unit

    Definition Classes
    AnyRef
  36. final def notifyAll(): Unit

    Definition Classes
    AnyRef
  37. def outStream(): PrintStream

    Definition Classes
    LogHelper
  38. def readTablePartition(tableKey: String, hivePartitionKey: Option[String], columnsUsed: BitSet, pruningFn: (RDD[TablePartition], Map[Int, TablePartitionStats]) ⇒ RDD[_]): RDD[_]

    Reads a particular Hive partition (or whole table if non-partitioned) into a single RDD.

    Reads a particular Hive partition (or whole table if non-partitioned) into a single RDD. Since each insert is written to its own directory, we need to reconstruct this RDD by reading across all of these insert directories. We thus column- and row-prune our data before producing the union to avoid data transfer.

    columnsUsed

    Indicates which columns are needed, to avoid loading extra data.

    pruningFn

    Used for pruning rows.

    Definition Classes
    TachyonStorageClientOffHeapStorageClient
  39. def renameTable(oldTableKey: String, newTableKey: String): Boolean

  40. final def synchronized[T0](arg0: ⇒ T0): T0

    Definition Classes
    AnyRef
  41. def tableExists(tableKey: String): Boolean

  42. def tablePartitionExists(tableKey: String, hivePartitionKey: Option[String]): Boolean

  43. val tfs: TachyonFS

  44. def toString(): String

    Definition Classes
    AnyRef → Any
  45. final def wait(): Unit

    Definition Classes
    AnyRef
    Annotations
    @throws( ... )
  46. final def wait(arg0: Long, arg1: Int): Unit

    Definition Classes
    AnyRef
    Annotations
    @throws( ... )
  47. final def wait(arg0: Long): Unit

    Definition Classes
    AnyRef
    Annotations
    @throws( ... )
  48. val warehousePath: String

Inherited from LogHelper

Inherited from Logging

Inherited from OffHeapStorageClient

Inherited from AnyRef

Inherited from Any

Ungrouped