Package

ai.chronon

spark

Permalink

package spark

Visibility
  1. Public
  2. All

Type Members

  1. class Analyzer extends AnyRef

    Permalink
  2. class Args extends ScallopConf

    Permalink
  3. abstract class BaseJoin extends AnyRef

    Permalink
  4. case class BootstrapInfo(joinConf: api.Join, joinParts: Seq[JoinPartMetadata], externalParts: Seq[ExternalPartMetadata], derivations: Array[StructField], hashToSchema: Map[String, Array[StructField]]) extends Product with Serializable

    Permalink
  5. class ChrononKryoRegistrator extends KryoRegistrator

    Permalink
  6. class CpcSketchKryoSerializer extends Serializer[CpcSketch]

    Permalink
  7. sealed trait DataRange extends AnyRef

    Permalink
  8. class DummyExtensions extends (SparkSessionExtensions) ⇒ Unit

    Permalink
  9. case class ExternalPartMetadata(externalPart: ExternalPart, keySchema: Array[StructField], valueSchema: Array[StructField]) extends Product with Serializable

    Permalink
  10. class GroupBy extends Serializable

    Permalink
  11. class GroupByUpload extends Serializable

    Permalink
  12. sealed case class IncompatibleSchemaException(inconsistencies: Seq[(String, DataType, DataType)]) extends Exception with Product with Serializable

    Permalink
  13. class ItemSketchSerializable extends Serializable

    Permalink
  14. class ItemsSketchKryoSerializer extends Serializer[ItemSketchSerializable]

    Permalink
  15. class Join extends BaseJoin

    Permalink
  16. case class JoinPartMetadata(joinPart: JoinPart, keySchema: Array[StructField], valueSchema: Array[StructField]) extends Product with Serializable

    Permalink
  17. case class KeyWithHash(data: Array[Any], hash: Array[Byte], hashInt: Int) extends Serializable with Product

    Permalink
  18. case class KvRdd(data: RDD[(Array[Any], Array[Any])], keySchema: StructType, valueSchema: StructType)(implicit sparkSession: SparkSession) extends Product with Serializable

    Permalink
  19. class LabelJoin extends AnyRef

    Permalink
  20. class LogFlattenerJob extends Serializable

    Permalink

    Purpose of LogFlattenerJob is to unpack serialized Avro data from online requests and flatten each field (both keys and values) into individual columns and save to an offline "flattened" log table.

    Purpose of LogFlattenerJob is to unpack serialized Avro data from online requests and flatten each field (both keys and values) into individual columns and save to an offline "flattened" log table.

    Steps: 1. determine unfilled range and pull raw logs from partitioned log table 2. fetch joinCodecs for all unique schema_hash present in the logs 3. build a merged schema from all schema versions, which will be used as output schema 4. unpack each row and adhere to the output schema 5. save the schema info in the flattened log table properties (cumulatively)

  21. case class LoggingSchema(keyCodec: AvroCodec, valueCodec: AvroCodec) extends Product with Serializable

    Permalink
  22. case class PartitionRange(start: String, end: String) extends DataRange with Ordered[PartitionRange] with Product with Serializable

    Permalink
  23. class StagingQuery extends AnyRef

    Permalink
  24. case class TableUtils(sparkSession: SparkSession) extends Product with Serializable

    Permalink
  25. case class TimeRange(start: Long, end: Long) extends DataRange with Product with Serializable

    Permalink

Value Members

  1. object BootstrapInfo extends Serializable

    Permalink
  2. object Comparison

    Permalink
  3. object Driver

    Permalink
  4. object Extensions

    Permalink
  5. object FastHashing

    Permalink
  6. object GenericRowHandler

    Permalink
  7. object GroupBy extends Serializable

    Permalink
  8. object GroupByUpload extends Serializable

    Permalink
  9. object JoinUtils

    Permalink
  10. object LocalDataLoader

    Permalink
  11. object LogFlattenerJob extends Serializable

    Permalink
  12. object LogUtils

    Permalink
  13. object LoggingSchema extends Serializable

    Permalink
  14. object MetadataExporter

    Permalink
  15. object SparkSessionBuilder

    Permalink
  16. object StagingQuery

    Permalink
  17. package stats

    Permalink
  18. package streaming

    Permalink

Ungrouped