Class/Object

com.audienceproject.crossbow

DataFrame

Related Docs: object DataFrame | package crossbow

Permalink

class DataFrame extends AnyRef

Linear Supertypes
AnyRef, Any
Ordering
  1. Alphabetic
  2. By Inheritance
Inherited
  1. DataFrame
  2. AnyRef
  3. Any
  1. Hide All
  2. Show All
Visibility
  1. Public
  2. All

Type Members

  1. class GroupedView extends AnyRef

    Permalink
  2. class TypedView[T] extends Iterable[T]

    Permalink

Value Members

  1. final def !=(arg0: Any): Boolean

    Permalink
    Definition Classes
    AnyRef → Any
  2. final def ##(): Int

    Permalink
    Definition Classes
    AnyRef → Any
  3. final def ==(arg0: Any): Boolean

    Permalink
    Definition Classes
    AnyRef → Any
  4. def addColumn(expr: Expr): DataFrame

    Permalink

    Add a column to the DataFrame, evaluating to 'expr' at each individual row index.

    Add a column to the DataFrame, evaluating to 'expr' at each individual row index. Use the 'as' method on Expr to give the column a name.

    expr

    the Expr to evaluate as the new column

    returns

    new DataFrame

  5. def apply(columnNames: String*): DataFrame

    Permalink

    Select a subset of columns from this DataFrame.

    Select a subset of columns from this DataFrame.

    columnNames

    names of columns to select

    returns

    new DataFrame

  6. def apply(range: Range): DataFrame

    Permalink

    Retrieve a subset of rows from this DataFrame based on range of indices.

    Retrieve a subset of rows from this DataFrame based on range of indices.

    range

    range of row indices to retrieve

    returns

    new DataFrame

  7. def apply(index: Int): Seq[Any]

    Permalink

    Retrieve a single row by index.

    Retrieve a single row by index.

    index

    row index

    returns

    row as a sequence of values

  8. def as[T](implicit arg0: scala.reflect.api.JavaUniverse.TypeTag[T]): TypedView[T]

    Permalink

    Typecast this DataFrame to a TypedView of the type parameter 'T'.

    Typecast this DataFrame to a TypedView of the type parameter 'T'. All columns in this DataFrame will have to be accounted for in the given type. A DataFrame with multiple columns will have its rows represented as tuples of the individual types of these columns.

    T

    the type of a row in this DataFrame

    returns

    TypedView on the contents of this DataFrame

  9. final def asInstanceOf[T0]: T0

    Permalink
    Definition Classes
    Any
  10. def clone(): AnyRef

    Permalink
    Attributes
    protected[java.lang]
    Definition Classes
    AnyRef
    Annotations
    @throws( ... )
  11. final def eq(arg0: AnyRef): Boolean

    Permalink
    Definition Classes
    AnyRef
  12. def equals(arg0: Any): Boolean

    Permalink
    Definition Classes
    AnyRef → Any
  13. def explode(expr: Expr): DataFrame

    Permalink

    Explode this DataFrame on the given expression, flattening its contents and repeating all other cells on the row for every element in the sequence.

    Explode this DataFrame on the given expression, flattening its contents and repeating all other cells on the row for every element in the sequence. The given Expr must evaluate to a list type. Use the 'as' method on Expr to name the flattened column.

    expr

    the Expr to explode on

    returns

    new DataFrame

  14. def filter(expr: Expr): DataFrame

    Permalink

    Retrieve a subset of rows from this DataFrame based on the boolean evaluation of the given expression.

    Retrieve a subset of rows from this DataFrame based on the boolean evaluation of the given expression.

    expr

    the Expr to evaluate, if 'true' the given row will appear in the output

    returns

    new DataFrame

  15. def finalize(): Unit

    Permalink
    Attributes
    protected[java.lang]
    Definition Classes
    AnyRef
    Annotations
    @throws( classOf[java.lang.Throwable] )
  16. final def getClass(): Class[_]

    Permalink
    Definition Classes
    AnyRef → Any
  17. def groupBy(keyExprs: Expr*): GroupedView

    Permalink

    Partition this DataFrame into groups, defined by the given set of expressions.

    Partition this DataFrame into groups, defined by the given set of expressions. The evaluation of each of the 'keyExprs' will appear as a column in the output.

    keyExprs

    the list of com.audienceproject.crossbow.expr.Expr that will evaluate to the keys of the groups

    returns

    GroupedView on this DataFrame

  18. def hashCode(): Int

    Permalink
    Definition Classes
    AnyRef → Any
  19. def isEmpty: Boolean

    Permalink
  20. final def isInstanceOf[T0]: Boolean

    Permalink
    Definition Classes
    Any
  21. def iterator: Iterator[Seq[Any]]

    Permalink
  22. def join(other: DataFrame, joinExpr: Expr, joinType: JoinType = JoinType.Inner): DataFrame

    Permalink

    Join this DataFrame on another DataFrame, with the key evaluated by 'joinExpr'.

    Join this DataFrame on another DataFrame, with the key evaluated by 'joinExpr'. The resulting DataFrame will contain all the columns of this DataFrame and the other, where the column names of the other will be prepended with "#".

    other

    DataFrame to join with this one

    joinExpr

    Expr to evaluate as join key

    joinType

    JoinType as one of Inner, FullOuter, LeftOuter or RightOuter

    returns

    new DataFrame

    Note

    'joinExpr' must evaluate to a type with a natural ordering

  23. final def ne(arg0: AnyRef): Boolean

    Permalink
    Definition Classes
    AnyRef
  24. final def notify(): Unit

    Permalink
    Definition Classes
    AnyRef
  25. final def notifyAll(): Unit

    Permalink
    Definition Classes
    AnyRef
  26. val numColumns: Int

    Permalink
  27. def printSchema(): Unit

    Permalink
  28. def removeColumns(columnNames: String*): DataFrame

    Permalink

    Remove one or more columns from the DataFrame.

    Remove one or more columns from the DataFrame.

    columnNames

    the names of the columns to remove

    returns

    new DataFrame

  29. def renameColumns(toNewName: (String) ⇒ String): DataFrame

    Permalink

    Rename the columns of this DataFrame by applying the given function.

    Rename the columns of this DataFrame by applying the given function.

    toNewName

    function to map over the names of the columns

    returns

    new DataFrame

  30. def renameColumns(newNames: String*): DataFrame

    Permalink

    Rename the columns of this DataFrame.

    Rename the columns of this DataFrame.

    newNames

    list of new names for each column of this DataFrame

    returns

    new DataFrame

  31. val rowCount: Int

    Permalink
  32. val schema: Schema

    Permalink
  33. def select(exprs: Expr*): DataFrame

    Permalink

    Map over this DataFrame, selecting a set of expressions which will become the columns of a new DataFrame.

    Map over this DataFrame, selecting a set of expressions which will become the columns of a new DataFrame. Use the 'as' method on Expr to give names to the new columns. An expression which is only a column accessor will inherit the accessed column's name (unless it is renamed).

    exprs

    the list of Expr to evaluate as a new DataFrame

    returns

    new DataFrame

  34. def sortBy(expr: Expr, givenOrderings: Order*): DataFrame

    Permalink

    Sort this DataFrame by the evaluation of 'expr'.

    Sort this DataFrame by the evaluation of 'expr'. If a natural ordering exists on this value, it will be used. User-defined orderings on other types or for overwriting the natural orderings with an explicit ordering can be supplied through the 'givenOrderings' argument.

    expr

    the Expr to evaluate as a sort key

    givenOrderings

    explicit Order to use on the sort key, or list of Order if the key is a tuple

    returns

    new DataFrame

  35. final def synchronized[T0](arg0: ⇒ T0): T0

    Permalink
    Definition Classes
    AnyRef
  36. def toString(): String

    Permalink
    Definition Classes
    AnyRef → Any
  37. def union(other: DataFrame): DataFrame

    Permalink

    Union this DataFrame with another DataFrame.

    Union this DataFrame with another DataFrame. Columns will be matched by name, and if matched they must have the same type. Columns that are not present in one or the other DataFrame will contain null-values in the output for the rows of the DataFrame in which the column was not present.

    other

    DataFrame to union with this one

    returns

    new DataFrame

  38. final def wait(): Unit

    Permalink
    Definition Classes
    AnyRef
    Annotations
    @throws( ... )
  39. final def wait(arg0: Long, arg1: Int): Unit

    Permalink
    Definition Classes
    AnyRef
    Annotations
    @throws( ... )
  40. final def wait(arg0: Long): Unit

    Permalink
    Definition Classes
    AnyRef
    Annotations
    @throws( ... )

Inherited from AnyRef

Inherited from Any

Ungrouped