qscript

Type Members

sealed abstract class Binary[T[_[_]], A] extends MapFunc[T, A]
final case class BucketField[T[_[_]], A](src: A, value: FreeMap[T], name: FreeMap[T]) extends ProjectBucket[T, A] with Product with Serializable
final case class BucketIndex[T[_[_]], A](src: A, value: FreeMap[T], index: FreeMap[T]) extends ProjectBucket[T, A] with Product with Serializable
trait Bucketable[F[_]] extends AnyRef
type Bucketing[T[_[_]], A] = Coproduct[[β]QScriptBucket[T, β], [β]ProjectBucket[T, β], A]
sealed abstract class DeadEnd extends AnyRef
trait Diggable[F[_]] extends Serializable
final case class Drop[T[_[_]], A](src: A, from: FreeQS[T], count: FreeQS[T]) extends QScriptCore[T, A] with Product with Serializable
trait ElideBuckets[F[_]] extends Serializable
trait ElideBucketsInstances extends ElideBucketsInstances0
trait ElideBucketsInstances0 extends AnyRef
final case class EquiJoin[T[_[_]], A](src: A, lBranch: FreeQS[T], rBranch: FreeQS[T], lKey: FreeMap[T], rKey: FreeMap[T], f: JoinType, combine: JoinFunc[T]) extends Product with Serializable

This is an optional component of QScript that can be used instead of ThetaJoin.
This is an optional component of QScript that can be used instead of ThetaJoin. It’s easier to implement, but more restricted (where ThetaJoin has an arbitrary predicate to determin if a pair of records should be combined, EquiJoin has an expression on each side that is compared with simple equality).
type EquiQScript[T[_[_]], A] = Coproduct[[β]EquiJoin[T, β], [β]Coproduct[Read, [β]Coproduct[[β]QScriptCore[T, β], [β]Coproduct[[β]Const[DeadEnd, β], [β]SourcedPathable[T, β], β], β], β], A]

A variant with a simpler join type.
A variant with a simpler join type. A backend can choose to operate on this structure by applying the equiJoinsOnly transformation. Backends without true join support will likely find it easier to work with this than to handle full ThetaJoins.
final case class Filter[T[_[_]], A](src: A, f: FreeMap[T]) extends QScriptCore[T, A] with Product with Serializable

Eliminates some values from a dataset, based on the result of FilterFunc.
type FreeMap[T[_[_]]] = Free[[β]MapFunc[T, β], Unit]
type FreeQS[T[_[_]]] = Free[[β]Coproduct[[β]QScriptBucket[T, β], [β]Coproduct[[β]ProjectBucket[T, β], [β]Coproduct[[β]ThetaJoin[T, β], [β]Coproduct[[β]QScriptCore[T, β], [β]Coproduct[[β]Const[DeadEnd, β], [β]SourcedPathable[T, β], β], β], β], β], β], Unit]
type FreeUnit[F[_]] = Free[F, Unit]
final case class GroupBy[T[_[_]], A](src: A, values: FreeMap[T], bucket: FreeMap[T]) extends QScriptBucket[T, A] with Product with Serializable
trait Helpers[T[_[_]]] extends AnyRef
type JoinFunc[T[_[_]]] = Free[[β]MapFunc[T, β], JoinSide]
sealed trait JoinSide extends AnyRef
sealed abstract class JoinType extends AnyRef
final case class LeftShift[T[_[_]], A](src: A, struct: FreeMap[T], repair: JoinFunc[T]) extends SourcedPathable[T, A] with Product with Serializable

Flattens nested structure, converting each value into a data set, which are then unioned.
Flattens nested structure, converting each value into a data set, which are then unioned.
struct is an expression that evaluates to an array or object, which is then “exploded” into multiple values. repair is applied across the new set, integrating the exploded values into the original set.
final case class LeftShiftBucket[T[_[_]], A](src: A, struct: FreeMap[T], repair: JoinFunc[T], bucketShift: FreeMap[T]) extends QScriptBucket[T, A] with Product with Serializable
final case class Map[T[_[_]], A](src: A, f: FreeMap[T]) extends QScriptCore[T, A] with Product with Serializable

A data-level transformation.
sealed abstract class MapFunc[T[_[_]], A] extends AnyRef
trait Mergeable[A] extends Serializable
trait Normalizable[F[_]] extends Serializable
trait NormalizableInstances extends NormalizableInstances0
trait NormalizableInstances0 extends AnyRef
final case class Nullary[T[_[_]], A](ejson: T[EJson]) extends MapFunc[T, A] with Product with Serializable
class Optimize[T[_[_]]] extends Helpers[T]
type Pathable[T[_[_]], A] = Coproduct[[β]Const[DeadEnd, β], [β]SourcedPathable[T, β], A]
sealed abstract class ProjectBucket[T[_[_]], A] extends AnyRef

Projections are technically dimensional (i.e., QScript) operations.
Projections are technically dimensional (i.e., QScript) operations. However, to a filesystem, they are merely Map operations. So, we use these components while building the QScript plan and they are then used in static path processing, but they are replaced with equivalent MapFuncs before being processed by the filesystem.
type QSState[A] = IndexedStateT[[β]\/[PlannerError, β], NameGen, NameGen, A]
type QScript[T[_[_]], A] = Coproduct[[β]ThetaJoin[T, β], [β]Coproduct[Read, [β]Coproduct[[β]QScriptCore[T, β], [β]Coproduct[[β]Const[DeadEnd, β], [β]SourcedPathable[T, β], β], β], β], A]

This is the primary form seen by a backend.
This is the primary form seen by a backend. It contains reads of files.
sealed abstract class QScriptBucket[T[_[_]], A] extends AnyRef
type QScriptCommon[T[_[_]], A] = Coproduct[Read, [β]Coproduct[[β]QScriptCore[T, β], [β]Coproduct[[β]Const[DeadEnd, β], [β]SourcedPathable[T, β], β], β], A]

These nodes exist in all QScript structures that a backend sees.
sealed abstract class QScriptCore[T[_[_]], A] extends AnyRef
type QScriptInternal[T[_[_]], A] = Coproduct[[β]QScriptBucket[T, β], [β]Coproduct[[β]ProjectBucket[T, β], [β]Coproduct[[β]ThetaJoin[T, β], [β]Coproduct[[β]QScriptCore[T, β], [β]Coproduct[[β]Const[DeadEnd, β], [β]SourcedPathable[T, β], β], β], β], β], A]
type QScriptPrim[T[_[_]], A] = Coproduct[[β]QScriptCore[T, β], [β]Coproduct[[β]Const[DeadEnd, β], [β]SourcedPathable[T, β], β], A]

These are the operations included in all forms of QScript.
type QScriptProject[T[_[_]], A] = Coproduct[[β]ProjectBucket[T, β], [β]Coproduct[[β]ThetaJoin[T, β], [β]Coproduct[[β]QScriptCore[T, β], [β]Coproduct[[β]Const[DeadEnd, β], [β]SourcedPathable[T, β], β], β], β], A]
type QScriptPure[T[_[_]], A] = Coproduct[[β]ThetaJoin[T, β], [β]Coproduct[[β]QScriptCore[T, β], [β]Coproduct[[β]Const[DeadEnd, β], [β]SourcedPathable[T, β], β], β], A]

This is the target of the core compiler.
This is the target of the core compiler. Normalization is applied to this structure, and it contains no Read or EquiJoin.
final case class Read[A](src: A, path: AbsFile[Sandboxed]) extends Product with Serializable

A backend-resolved Root, which is now a path.
final case class Reduce[T[_[_]], A, N <: Nat](src: A, bucket: FreeMap[T], reducers: Sized[List[ReduceFunc[FreeMap[T]]], Succ[N]], repair: Free[[β]MapFunc[T, β], Fin[Succ[N]]]) extends QScriptCore[T, A] with Product with Serializable

Performs a reduction over a dataset, with the dataset partitioned by the result of the MapFunc.
Performs a reduction over a dataset, with the dataset partitioned by the result of the MapFunc. So, rather than many-to-one, this is many-to-fewer.
bucket partitions the values into buckets based on the result of the expression, reducers applies the provided reduction to each expression, and repair finally turns those reduced expressions into a final value.
sealed trait ReduceFunc[A] extends AnyRef
final case class Sort[T[_[_]], A](src: A, bucket: FreeMap[T], order: List[(FreeMap[T], SortDir)]) extends QScriptCore[T, A] with Product with Serializable

Sorts values within a bucket.
Sorts values within a bucket. This could be represented with LeftShift(Map(_.sort, Reduce(_ :: _, ???)) but backends tend to provide sort directly, so this avoids backends having to recognize the pattern. We could provide an algebra (Sort :+: QScript)#λ => QScript so that a backend without a native sort could eliminate this node.
sealed trait SortDir extends AnyRef
sealed abstract class SourcedPathable[T[_[_]], A] extends AnyRef
final case class SquashBucket[T[_[_]], A](src: A) extends QScriptBucket[T, A] with Product with Serializable
final case class SrcMerge[A, B](src: A, left: B, right: B) extends Product with Serializable
final case class Take[T[_[_]], A](src: A, from: FreeQS[T], count: FreeQS[T]) extends QScriptCore[T, A] with Product with Serializable
sealed abstract class Ternary[T[_[_]], A] extends MapFunc[T, A]
final case class ThetaJoin[T[_[_]], A](src: A, lBranch: FreeQS[T], rBranch: FreeQS[T], on: JoinFunc[T], f: JoinType, combine: JoinFunc[T]) extends Product with Serializable

Applies a function across two datasets, in the cases where the JoinFunc evaluates to true.
Applies a function across two datasets, in the cases where the JoinFunc evaluates to true. The branches represent the divergent operations applied to some common src. Each branch references the src exactly once. (Since no constructor has more than one recursive component, it’s guaranteed that neither side references the src _more_ than once.)
This case represents a full θJoin, but we could have an algebra that rewites it as Filter(_, EquiJoin(...)) to simplify behavior for the backend.
class Transform[T[_[_]], F[_]] extends Helpers[T]
sealed abstract class Unary[T[_[_]], A] extends MapFunc[T, A]
final case class Union[T[_[_]], A](src: A, lBranch: FreeQS[T], rBranch: FreeQS[T]) extends SourcedPathable[T, A] with Product with Serializable

Creates a new dataset, |a|+|b|, containing all of the entries from each of the input sets, without any indication of which set they came from
Creates a new dataset, |a|+|b|, containing all of the entries from each of the input sets, without any indication of which set they came from
This could be handled as another join type, the anti-join (T[EJson] \/ T[EJson] => T[EJson], specifically as _.merge), with the condition being κ(true),

Value Members

object BucketField extends Serializable
object BucketIndex extends Serializable
object Bucketable
val CommonEJson: :<:[Common, EJson]
object DeadEnd
object Diggable extends Serializable
object Drop extends Serializable
object ElideBuckets extends ElideBucketsInstances with Serializable
object Empty extends DeadEnd with Product with Serializable
object EquiJoin extends Serializable
val ExtEJson: :<:[Extension, EJson]
object Filter extends Serializable
object FullOuter extends JoinType with Product with Serializable
object GroupBy extends Serializable
object Inner extends JoinType with Product with Serializable
object JoinSide
object JoinType
object LeftOuter extends JoinType with Product with Serializable
object LeftShift extends Serializable
object LeftShiftBucket extends Serializable
object LeftSide extends JoinSide with Product with Serializable
object Map extends Serializable
object MapFunc
object MapFuncs
object Mergeable extends Serializable
object Normalizable extends NormalizableInstances with Serializable
object ProjectBucket
object QScriptBucket
object QScriptCore
object Read extends Serializable
object Reduce extends Serializable
object ReduceFunc
object ReduceFuncs
object RightOuter extends JoinType with Product with Serializable
object RightSide extends JoinSide with Product with Serializable
object Root extends DeadEnd with Product with Serializable

The top level of a filesystem.
The top level of a filesystem. During compilation this represents /, but in the structure a backend sees, it represents the mount point.
object Sort extends Serializable
object SortDir
object SourcedPathable
object SquashBucket extends Serializable
object Take extends Serializable
object ThetaJoin extends Serializable
object Union extends Serializable
def UnitF[T[_[_]]]: Free[[β]MapFunc[T, β], Unit]
def rebase[M[_], A](in: M[A], field: M[A])(implicit arg0: Bind[M]): M[A]

package qscript

Type Members

sealed abstract class Binary[T[_[_]], A] extends MapFunc[T, A]

final case class BucketField[T[_[_]], A](src: A, value: FreeMap[T], name: FreeMap[T]) extends ProjectBucket[T, A] with Product with Serializable

final case class BucketIndex[T[_[_]], A](src: A, value: FreeMap[T], index: FreeMap[T]) extends ProjectBucket[T, A] with Product with Serializable

trait Bucketable[F[_]] extends AnyRef

type Bucketing[T[_[_]], A] = Coproduct[[β]QScriptBucket[T, β], [β]ProjectBucket[T, β], A]

sealed abstract class DeadEnd extends AnyRef

trait Diggable[F[_]] extends Serializable

final case class Drop[T[_[_]], A](src: A, from: FreeQS[T], count: FreeQS[T]) extends QScriptCore[T, A] with Product with Serializable

trait ElideBuckets[F[_]] extends Serializable

trait ElideBucketsInstances extends ElideBucketsInstances0

trait ElideBucketsInstances0 extends AnyRef

final case class EquiJoin[T[_[_]], A](src: A, lBranch: FreeQS[T], rBranch: FreeQS[T], lKey: FreeMap[T], rKey: FreeMap[T], f: JoinType, combine: JoinFunc[T]) extends Product with Serializable

type EquiQScript[T[_[_]], A] = Coproduct[[β]EquiJoin[T, β], [β]Coproduct[Read, [β]Coproduct[[β]QScriptCore[T, β], [β]Coproduct[[β]Const[DeadEnd, β], [β]SourcedPathable[T, β], β], β], β], A]

final case class Filter[T[_[_]], A](src: A, f: FreeMap[T]) extends QScriptCore[T, A] with Product with Serializable

type FreeMap[T[_[_]]] = Free[[β]MapFunc[T, β], Unit]

type FreeQS[T[_[_]]] = Free[[β]Coproduct[[β]QScriptBucket[T, β], [β]Coproduct[[β]ProjectBucket[T, β], [β]Coproduct[[β]ThetaJoin[T, β], [β]Coproduct[[β]QScriptCore[T, β], [β]Coproduct[[β]Const[DeadEnd, β], [β]SourcedPathable[T, β], β], β], β], β], β], Unit]

type FreeUnit[F[_]] = Free[F, Unit]

final case class GroupBy[T[_[_]], A](src: A, values: FreeMap[T], bucket: FreeMap[T]) extends QScriptBucket[T, A] with Product with Serializable

trait Helpers[T[_[_]]] extends AnyRef

type JoinFunc[T[_[_]]] = Free[[β]MapFunc[T, β], JoinSide]

sealed trait JoinSide extends AnyRef

sealed abstract class JoinType extends AnyRef

final case class LeftShift[T[_[_]], A](src: A, struct: FreeMap[T], repair: JoinFunc[T]) extends SourcedPathable[T, A] with Product with Serializable

final case class LeftShiftBucket[T[_[_]], A](src: A, struct: FreeMap[T], repair: JoinFunc[T], bucketShift: FreeMap[T]) extends QScriptBucket[T, A] with Product with Serializable

final case class Map[T[_[_]], A](src: A, f: FreeMap[T]) extends QScriptCore[T, A] with Product with Serializable

sealed abstract class MapFunc[T[_[_]], A] extends AnyRef

trait Mergeable[A] extends Serializable

trait Normalizable[F[_]] extends Serializable

trait NormalizableInstances extends NormalizableInstances0

trait NormalizableInstances0 extends AnyRef

final case class Nullary[T[_[_]], A](ejson: T[EJson]) extends MapFunc[T, A] with Product with Serializable

class Optimize[T[_[_]]] extends Helpers[T]

type Pathable[T[_[_]], A] = Coproduct[[β]Const[DeadEnd, β], [β]SourcedPathable[T, β], A]

sealed abstract class ProjectBucket[T[_[_]], A] extends AnyRef

type QSState[A] = IndexedStateT[[β]\/[PlannerError, β], NameGen, NameGen, A]

type QScript[T[_[_]], A] = Coproduct[[β]ThetaJoin[T, β], [β]Coproduct[Read, [β]Coproduct[[β]QScriptCore[T, β], [β]Coproduct[[β]Const[DeadEnd, β], [β]SourcedPathable[T, β], β], β], β], A]

sealed abstract class QScriptBucket[T[_[_]], A] extends AnyRef

type QScriptCommon[T[_[_]], A] = Coproduct[Read, [β]Coproduct[[β]QScriptCore[T, β], [β]Coproduct[[β]Const[DeadEnd, β], [β]SourcedPathable[T, β], β], β], A]

sealed abstract class QScriptCore[T[_[_]], A] extends AnyRef

type QScriptInternal[T[_[_]], A] = Coproduct[[β]QScriptBucket[T, β], [β]Coproduct[[β]ProjectBucket[T, β], [β]Coproduct[[β]ThetaJoin[T, β], [β]Coproduct[[β]QScriptCore[T, β], [β]Coproduct[[β]Const[DeadEnd, β], [β]SourcedPathable[T, β], β], β], β], β], A]

type QScriptPrim[T[_[_]], A] = Coproduct[[β]QScriptCore[T, β], [β]Coproduct[[β]Const[DeadEnd, β], [β]SourcedPathable[T, β], β], A]

type QScriptProject[T[_[_]], A] = Coproduct[[β]ProjectBucket[T, β], [β]Coproduct[[β]ThetaJoin[T, β], [β]Coproduct[[β]QScriptCore[T, β], [β]Coproduct[[β]Const[DeadEnd, β], [β]SourcedPathable[T, β], β], β], β], A]

type QScriptPure[T[_[_]], A] = Coproduct[[β]ThetaJoin[T, β], [β]Coproduct[[β]QScriptCore[T, β], [β]Coproduct[[β]Const[DeadEnd, β], [β]SourcedPathable[T, β], β], β], A]

final case class Read[A](src: A, path: AbsFile[Sandboxed]) extends Product with Serializable

final case class Reduce[T[_[_]], A, N <: Nat](src: A, bucket: FreeMap[T], reducers: Sized[List[ReduceFunc[FreeMap[T]]], Succ[N]], repair: Free[[β]MapFunc[T, β], Fin[Succ[N]]]) extends QScriptCore[T, A] with Product with Serializable

sealed trait ReduceFunc[A] extends AnyRef

final case class Sort[T[_[_]], A](src: A, bucket: FreeMap[T], order: List[(FreeMap[T], SortDir)]) extends QScriptCore[T, A] with Product with Serializable

sealed trait SortDir extends AnyRef

sealed abstract class SourcedPathable[T[_[_]], A] extends AnyRef

final case class SquashBucket[T[_[_]], A](src: A) extends QScriptBucket[T, A] with Product with Serializable

final case class SrcMerge[A, B](src: A, left: B, right: B) extends Product with Serializable

final case class Take[T[_[_]], A](src: A, from: FreeQS[T], count: FreeQS[T]) extends QScriptCore[T, A] with Product with Serializable

sealed abstract class Ternary[T[_[_]], A] extends MapFunc[T, A]

final case class ThetaJoin[T[_[_]], A](src: A, lBranch: FreeQS[T], rBranch: FreeQS[T], on: JoinFunc[T], f: JoinType, combine: JoinFunc[T]) extends Product with Serializable

class Transform[T[_[_]], F[_]] extends Helpers[T]

sealed abstract class Unary[T[_[_]], A] extends MapFunc[T, A]

final case class Union[T[_[_]], A](src: A, lBranch: FreeQS[T], rBranch: FreeQS[T]) extends SourcedPathable[T, A] with Product with Serializable

Value Members

object BucketField extends Serializable

object BucketIndex extends Serializable

object Bucketable

val CommonEJson: :<:[Common, EJson]

object DeadEnd

object Diggable extends Serializable

object Drop extends Serializable

object ElideBuckets extends ElideBucketsInstances with Serializable

object Empty extends DeadEnd with Product with Serializable

object EquiJoin extends Serializable

val ExtEJson: :<:[Extension, EJson]

object Filter extends Serializable

object FullOuter extends JoinType with Product with Serializable

object GroupBy extends Serializable

object Inner extends JoinType with Product with Serializable

object JoinSide

object JoinType

object LeftOuter extends JoinType with Product with Serializable

object LeftShift extends Serializable