SimpleByteArrayTransformer

Instance Constructors

new SimpleByteArrayTransformer()

Abstract Value Members

abstract def transform(sqlContext: SQLContext, rdd: RDD[Array[Byte]], config: UserTransformConfig, logger: PhaseLogger): DataFrame

Convenience method for transforming org.apache.spark.rdd.RDDs into org.apache.spark.sql.DataFrame> This is called once per batch on the org.apache.spark.rdd.RDD generated by the Extractor and the result is passed to the Loader.
Convenience method for transforming org.apache.spark.rdd.RDDs into org.apache.spark.sql.DataFrame> This is called once per batch on the org.apache.spark.rdd.RDD generated by the Extractor and the result is passed to the Loader.
sqlContext
The SQLContext that is used to run this pipeline. NOTE: If the pipeline is running in MemSQL Streamliner, this is an instance of com.memsql.spark.context.MemSQLContext, which has additional metadata about the MemSQL cluster.
rdd
The org.apache.spark.rdd.RDD for this batch generated by the Extractor.
config
The user defined configuration passed from MemSQL Ops.
logger
A logger instance that is integrated with MemSQL Ops.
returns
A org.apache.spark.sql.DataFrame with the transformed data to be loaded.

Concrete Value Members

final def !=(arg0: AnyRef): Boolean

Definition Classes
AnyRef
final def !=(arg0: Any): Boolean

Definition Classes
Any
final def ##(): Int

Definition Classes
AnyRef → Any
final def ==(arg0: AnyRef): Boolean

Definition Classes
AnyRef
final def ==(arg0: Any): Boolean

Definition Classes
Any
final def asInstanceOf[T0]: T0

Definition Classes
Any
final var byteUtils: ByteUtils.type

Definition Classes
ByteArrayTransformer
def clone(): AnyRef

Attributes
protected[java.lang]
Definition Classes
AnyRef
Annotations
@throws( ... )
final def eq(arg0: AnyRef): Boolean

Definition Classes
AnyRef
def equals(arg0: Any): Boolean

Definition Classes
AnyRef → Any
def finalize(): Unit

Attributes
protected[java.lang]
Definition Classes
AnyRef
Annotations
@throws( classOf[java.lang.Throwable] )
final def getClass(): Class[_]

Definition Classes
AnyRef → Any
def hashCode(): Int

Definition Classes
AnyRef → Any
def initialize(sqlContext: SQLContext, config: UserTransformConfig, logger: PhaseLogger): Unit

Initialization code for your Transformer.
Initialization code for your Transformer. This is called after instantiation of your Transformer and before Transformer.transform. The default implementation does nothing.
sqlContext
The SQLContext that is used to run this pipeline. NOTE: If the pipeline is running in MemSQL Streamliner, this is an instance of com.memsql.spark.context.MemSQLContext, which has additional metadata about the MemSQL cluster.
config
The user defined configuration passed from MemSQL Ops.
logger
A logger instance that is integrated with MemSQL Ops.
final def initialize(sqlContext: SQLContext, config: PhaseConfig, logger: PhaseLogger): Unit

Initialization code for this Extractor
Initialization code for this Extractor
sqlContext
The SQLContext that is used to run this pipeline. NOTE: If the pipeline is running in MemSQL Streamliner, this is an instance of com.memsql.spark.context.MemSQLContext, which has additional metadata about the MemSQL cluster.
config
The Transformer configuration passed from MemSQL Ops.
logger
A logger instance that is integrated with MemSQL Ops.

Definition Classes
SimpleByteArrayTransformer → Transformer
final def isInstanceOf[T0]: Boolean

Definition Classes
Any
final def ne(arg0: AnyRef): Boolean

Definition Classes
AnyRef
final def notify(): Unit

Definition Classes
AnyRef
final def notifyAll(): Unit

Definition Classes
AnyRef
final def synchronized[T0](arg0: ⇒ T0): T0

Definition Classes
AnyRef
def toString(): String

Definition Classes
AnyRef → Any
final def transform(sqlContext: SQLContext, rdd: RDD[Array[Byte]], config: PhaseConfig, logger: PhaseLogger): DataFrame

Transforms the incoming org.apache.spark.rdd.RDD into a org.apache.spark.sql.DataFrame.
Transforms the incoming org.apache.spark.rdd.RDD into a org.apache.spark.sql.DataFrame.
sqlContext
The SQLContext that is used to run this pipeline. NOTE: If the pipeline is running in MemSQL Streamliner, this is an instance of com.memsql.spark.context.MemSQLContext, which has additional metadata about the MemSQL cluster.
rdd
The org.apache.spark.rdd.RDD generated by the Extractor for this batch.
config
The Transformer configuration passed from MemSQL Ops.
logger
A logger instance that is integrated with MemSQL Ops.
returns
A org.apache.spark.sql.DataFrame with the transformed data to be loaded.

Definition Classes
SimpleByteArrayTransformer → Transformer
final def wait(): Unit

Definition Classes
AnyRef
Annotations
@throws( ... )
final def wait(arg0: Long, arg1: Int): Unit

Definition Classes
AnyRef
Annotations
@throws( ... )
final def wait(arg0: Long): Unit

Definition Classes
AnyRef
Annotations
@throws( ... )

abstract class SimpleByteArrayTransformer extends ByteArrayTransformer

Instance Constructors

new SimpleByteArrayTransformer()

Abstract Value Members

abstract def transform(sqlContext: SQLContext, rdd: RDD[Array[Byte]], config: UserTransformConfig, logger: PhaseLogger): DataFrame

Concrete Value Members

final def !=(arg0: AnyRef): Boolean

final def !=(arg0: Any): Boolean

final def ##(): Int

final def ==(arg0: AnyRef): Boolean

final def ==(arg0: Any): Boolean

final def asInstanceOf[T0]: T0

final var byteUtils: ByteUtils.type

def clone(): AnyRef

final def eq(arg0: AnyRef): Boolean

def equals(arg0: Any): Boolean

def finalize(): Unit

final def getClass(): Class[_]

def hashCode(): Int

def initialize(sqlContext: SQLContext, config: UserTransformConfig, logger: PhaseLogger): Unit

final def initialize(sqlContext: SQLContext, config: PhaseConfig, logger: PhaseLogger): Unit

final def isInstanceOf[T0]: Boolean

final def ne(arg0: AnyRef): Boolean

final def notify(): Unit

final def notifyAll(): Unit

final def synchronized[T0](arg0: ⇒ T0): T0

def toString(): String

final def transform(sqlContext: SQLContext, rdd: RDD[Array[Byte]], config: PhaseConfig, logger: PhaseLogger): DataFrame

final def wait(): Unit

final def wait(arg0: Long, arg1: Int): Unit

final def wait(arg0: Long): Unit

Inherited from ByteArrayTransformer

Inherited from Transformer[Array[Byte]]

Inherited from Serializable

Inherited from Serializable

Inherited from AnyRef

Inherited from Any

Ungrouped