ArrayColumnSyntax

Instance Constructors

new ArrayColumnSyntax(col: DoricColumn[F[T]])(implicit arg0: CollectionType[F])

Value Members

final def !=(arg0: Any): Boolean

Definition Classes
AnyRef → Any
final def ##(): Int

Definition Classes
AnyRef → Any
def +(other: String): String

Implicit information
This member is added by an implicit conversion from ArrayColumnSyntax[T, F] to any2stringadd[ArrayColumnSyntax[T, F]] performed by method any2stringadd in scala.Predef.
Definition Classes
any2stringadd
def ->[B](y: B): (ArrayColumnSyntax[T, F], B)

Implicit information
This member is added by an implicit conversion from ArrayColumnSyntax[T, F] to ArrowAssoc[ArrayColumnSyntax[T, F]] performed by method ArrowAssoc in scala.Predef.
Definition Classes
ArrowAssoc
Annotations
@inline()
final def ==(arg0: Any): Boolean

Definition Classes
AnyRef → Any
def aggregate[A](zero: DoricColumn[A])(merge: (DoricColumn[A], DoricColumn[T]) ⇒ DoricColumn[A]): DoricColumn[A]

Aggregates (reduce) the array with the provided functions, similar to scala fold left in collections.
Aggregates (reduce) the array with the provided functions, similar to scala fold left in collections.
A
type of the transformed values.
zero
zero value.
merge
function to combine the previous result with the element of the array.
returns
the column reference with the applied transformation.

To do
scaladoc link (issue #135)
See also
org.apache.spark.sql.functions.aggregate
def aggregateWT[A, B](zero: DoricColumn[A])(merge: (DoricColumn[A], DoricColumn[T]) ⇒ DoricColumn[A], finish: (DoricColumn[A]) ⇒ DoricColumn[B]): DoricColumn[B]

Aggregates (reduce) the array with the provided functions, similar to scala fold left in collections, with a final transformation.
Aggregates (reduce) the array with the provided functions, similar to scala fold left in collections, with a final transformation.
A
type of the intermediate values
B
type of the final value to return
zero
zero value
merge
function to combine the previous result with the element of the array
finish
the final transformation
returns
the column reference with the applied transformation.

To do
scaladoc link (issue #135)
See also
org.apache.spark.sql.functions.aggregate
final def asInstanceOf[T0]: T0

Definition Classes
Any
def clone(): AnyRef

Attributes
protected[java.lang]
Definition Classes
AnyRef
Annotations
@throws( ... )
def contains[A](value: DoricColumn[A]): BooleanColumn

Returns null if the array is null, true if the array contains value, and false otherwise.
Returns null if the array is null, true if the array contains value, and false otherwise.

See also
org.apache.spark.sql.functions.array_contains
def distinct: ArrayColumn[T]

Removes duplicate values from the array.
Removes duplicate values from the array.

See also
org.apache.spark.sql.functions.array_distinct
def elementAt(pos: IntegerColumn): DoricColumn[T]

Returns element of array at given index in value.
Returns element of array at given index in value.

See also
org.apache.spark.sql.functions.element_at
def ensuring(cond: (ArrayColumnSyntax[T, F]) ⇒ Boolean, msg: ⇒ Any): ArrayColumnSyntax[T, F]

Implicit information
This member is added by an implicit conversion from ArrayColumnSyntax[T, F] to Ensuring[ArrayColumnSyntax[T, F]] performed by method Ensuring in scala.Predef.
Definition Classes
Ensuring
def ensuring(cond: (ArrayColumnSyntax[T, F]) ⇒ Boolean): ArrayColumnSyntax[T, F]

Implicit information
This member is added by an implicit conversion from ArrayColumnSyntax[T, F] to Ensuring[ArrayColumnSyntax[T, F]] performed by method Ensuring in scala.Predef.
Definition Classes
Ensuring
def ensuring(cond: Boolean, msg: ⇒ Any): ArrayColumnSyntax[T, F]

Implicit information
This member is added by an implicit conversion from ArrayColumnSyntax[T, F] to Ensuring[ArrayColumnSyntax[T, F]] performed by method Ensuring in scala.Predef.
Definition Classes
Ensuring
def ensuring(cond: Boolean): ArrayColumnSyntax[T, F]

Implicit information
This member is added by an implicit conversion from ArrayColumnSyntax[T, F] to Ensuring[ArrayColumnSyntax[T, F]] performed by method Ensuring in scala.Predef.
Definition Classes
Ensuring
final def eq(arg0: AnyRef): Boolean

Definition Classes
AnyRef
def equals(arg0: Any): Boolean

Definition Classes
AnyRef → Any
def except(col2: ArrayColumn[T]): ArrayColumn[T]

Returns an array of the elements in the first array but not in the second array, without duplicates.
Returns an array of the elements in the first array but not in the second array, without duplicates. The order of elements in the result is not determined

See also
org.apache.spark.sql.functions.array_except
def exists(fun: (DoricColumn[T]) ⇒ BooleanColumn): BooleanColumn

Returns whether a predicate holds for one or more elements in the array.
Returns whether a predicate holds for one or more elements in the array.
Example:
1. df.select(colArray("i").exists(_ % 2 === 0))
To do
scaladoc link not available for spark 2.4
See also
org.apache.spark.sql.functions.exists
def explode: DoricColumn[T]

Creates a new row for each element in the given array column.
Creates a new row for each element in the given array column.

See also
org.apache.spark.sql.functions.explode
def explodeOuter: DoricColumn[T]

Creates a new row for each element in the given array column.
Creates a new row for each element in the given array column. Unlike explode, if the array is null or empty then null is produced.

See also
org.apache.spark.sql.functions.explode_outer
def filter(p: (DoricColumn[T]) ⇒ BooleanColumn): DoricColumn[F[T]]

Filters the array elements using the provided condition.
Filters the array elements using the provided condition.
p
the condition to filter.
returns
the column reference with the filter applied.

To do
scaladoc link (issue #135)
See also
org.apache.spark.sql.functions.filter
def finalize(): Unit

Attributes
protected[java.lang]
Definition Classes
AnyRef
Annotations
@throws( classOf[java.lang.Throwable] )
def formatted(fmtstr: String): String

Implicit information
This member is added by an implicit conversion from ArrayColumnSyntax[T, F] to StringFormat[ArrayColumnSyntax[T, F]] performed by method StringFormat in scala.Predef.
Definition Classes
StringFormat
Annotations
@inline()
final def getClass(): Class[_]

Definition Classes
AnyRef → Any
def getIndex(n: Int): DoricColumn[T]

Selects the nth element of the array, returns null value if the length is shorter than n.
Selects the nth element of the array, returns null value if the length is shorter than n.
n
the index of the element to retrieve.
returns
the DoricColumn with the selected element.
def hashCode(): Int

Definition Classes
AnyRef → Any
def intersect(col2: ArrayColumn[T]): ArrayColumn[T]

Returns an array of the elements in the intersection of the given two arrays, without duplicates.
Returns an array of the elements in the intersection of the given two arrays, without duplicates.

See also
org.apache.spark.sql.functions.array_intersect
final def isInstanceOf[T0]: Boolean

Definition Classes
Any
def join(delimiter: StringColumn): StringColumn

Concatenates the elements of column using the delimiter.
Concatenates the elements of column using the delimiter. Nulls are deleted

To do
scaladoc link (issue #135)
See also
org.apache.spark.sql.functions.array_join
def join(delimiter: StringColumn, nullReplacement: StringColumn): StringColumn

Concatenates the elements of column using the delimiter.
Concatenates the elements of column using the delimiter. Null values are replaced with nullReplacement.

To do
scaladoc link (issue #135)
See also
org.apache.spark.sql.functions.array_join
def lit(implicit l: Location): LiteralDoricColumn[ArrayColumnSyntax[T, F]]

Transforms the original value to a literal.
Transforms the original value to a literal.
returns
a literal with the same type.

Implicit information
This member is added by an implicit conversion from ArrayColumnSyntax[T, F] to doric.LiteralOps[ArrayColumnSyntax[T, F]] performed by method LiteralOps in doric.syntax.LiteralConversions. This conversion will take place only if an implicit value of type SparkType[ArrayColumnSyntax[T, F]] is in scope and at the same time an implicit value of type LiteralSparkType[ArrayColumnSyntax[T, F]] is in scope.
Definition Classes
LiteralOps
Annotations
@inline()
def mapFromArrays[V](values: DoricColumn[F[V]]): MapColumn[T, V]

Creates a new map column.
Creates a new map column. The array in the first column is used for keys. The array in the second column is used for values.

Exceptions thrown
java.lang.RuntimeException if arrays doesn't have the same length. or if a key is null
See also
org.apache.spark.sql.functions.map_from_arrays
def max: DoricColumn[T]

Returns the maximum value in the array.
Returns the maximum value in the array.

See also
org.apache.spark.sql.functions.array_max
def min: DoricColumn[T]

Returns the minimum value in the array.
Returns the minimum value in the array.

See also
org.apache.spark.sql.functions.array_min
final def ne(arg0: AnyRef): Boolean

Definition Classes
AnyRef
final def notify(): Unit

Definition Classes
AnyRef
final def notifyAll(): Unit

Definition Classes
AnyRef
def overlaps(col2: DoricColumn[F[T]]): BooleanColumn

Returns true if a1 and a2 have at least one non-null element in common.
Returns true if a1 and a2 have at least one non-null element in common. If not and both the arrays are non-empty and any of them contains a null, it returns null. It returns false otherwise.

See also
org.apache.spark.sql.functions.arrays_overlap

def posExplode: DoricColumn[Row]

Creates a new row for each element with position in the given array column.

Note: WARNING: Unlike spark, doric returns a struct
,
Uses the default column name pos for position, and value for elements in the array
See also: org.apache.spark.sql.functions.posexplode

def posExplodeOuter: DoricColumn[Row]

Creates a new row for each element with position in the given array column.

Creates a new row for each element with position in the given array column. Unlike posexplode, if the array is null or empty then the row null is produced.

Note: WARNING: Unlike spark, doric returns a struct
,
Uses the default column name pos for position, and col for elements in the array
See also: org.apache.spark.sql.functions.posexplode_outer

def positionOf[B](col2: DoricColumn[B]): LongColumn

Locates the position of the first occurrence of the value in the given array as long.
Locates the position of the first occurrence of the value in the given array as long. Returns null if either of the arguments are null.

Note
The position is not zero based, but 1 based index. Returns 0 if value could not be found in array.
See also
org.apache.spark.sql.functions.array_position
def remove[B](col2: DoricColumn[B]): ArrayColumn[T]

Remove all elements that equal to element from the given array.
Remove all elements that equal to element from the given array.

See also
org.apache.spark.sql.functions.array_remove
def reverse: DoricColumn[F[T]]

Returns an array with reverse order of elements.
Returns an array with reverse order of elements.

See also
org.apache.spark.sql.functions.reverse
def shuffle: DoricColumn[F[T]]

Returns a random permutation of the given array.
Returns a random permutation of the given array.

Note
The function is non-deterministic.
See also
org.apache.spark.sql.functions.shuffle
def size: IntegerColumn

Returns length of array.
Returns length of array.
The function returns null for null input if spark.sql.legacy.sizeOfNull is set to false or spark.sql.ansi.enabled is set to true. Otherwise, the function returns -1 for null input. With the default settings, the function returns -1 for null input.

See also
org.apache.spark.sql.functions.size
def slice(start: IntegerColumn, length: IntegerColumn): DoricColumn[F[T]]

Returns an array containing all the elements in the column from index start (or starting from the end if start is negative) with the specified length.
Returns an array containing all the elements in the column from index start (or starting from the end if start is negative) with the specified length.

To do
scaladoc link (issue #135)
Note
if start == 0 an exception will be thrown
See also
org.apache.spark.sql.functions.slice
def sort(asc: BooleanColumn): DoricColumn[F[T]]

Sorts the input array for the given column in ascending or descending order, according to the natural ordering of the array elements.
Sorts the input array for the given column in ascending or descending order, according to the natural ordering of the array elements. Null elements will be placed at the beginning of the returned array in ascending order or at the end of the returned array in descending order.

See also
org.apache.spark.sql.functions.sort_array
def sortAscNullsFirst: DoricColumn[F[T]]

Sorts the input array for the given column in ascending order, according to the natural ordering of the array elements.
Sorts the input array for the given column in ascending order, according to the natural ordering of the array elements. Null elements will be placed at the beginning of the returned array.

See also
org.apache.spark.sql.functions.sort_array
def sortAscNullsLast: DoricColumn[F[T]]

Sorts the input array in ascending order.
Sorts the input array in ascending order. The elements of the input array must be orderable. Null elements will be placed at the end of the returned array.

See also
https://spark.apache.org/docs/latest/api/scala/org/apache/spark/sql/functions$.html#array_sort(e:org.apache.spark.sql.Column):org.apache.spark.sql.Column
final def synchronized[T0](arg0: ⇒ T0): T0

Definition Classes
AnyRef
def toJson(options: Map[String, String] = Map.empty): StringColumn

Converts a column containing a StructType into a JSON string with the specified schema.
Converts a column containing a StructType into a JSON string with the specified schema.

Exceptions thrown
java.lang.IllegalArgumentException in the case of an unsupported type.
To do
scaladoc link (issue #135)
See also
org.apache.spark.sql.functions.to_json(e:org\.apache\.spark\.sql\.Column,options:scala\.collection\.immutable\.Map\[java\.lang\.String,java\.lang\.String\]):* org.apache.spark.sql.functions.to_csv
def toMap[V](values: DoricColumn[F[V]]): MapColumn[T, V]

Creates a new map column.
Creates a new map column. The array in the first column is used for keys. The array in the second column is used for values.

Exceptions thrown
java.lang.RuntimeException if arrays doesn't have the same length or if a key is null
See also
mapFromArrays
def toString(): String

Definition Classes
AnyRef → Any
def transform[A](fun: (DoricColumn[T]) ⇒ DoricColumn[A]): DoricColumn[F[A]]

Transform each element with the provided function.
Transform each element with the provided function.
A
the type of the array elements to return.
fun
lambda with the transformation to apply.
returns
the column reference with the applied transformation.

To do
scaladoc link (issue #135)
See also
org.apache.spark.sql.functions.transform
def transformWithIndex[A](fun: (DoricColumn[T], IntegerColumn) ⇒ DoricColumn[A]): DoricColumn[F[A]]

Transform each element of the array with the provided function that provides the index of the element in the array.
Transform each element of the array with the provided function that provides the index of the element in the array.
A
the type of the elements of the array
fun
the lambda that takes in account the element of the array and the index and returns a new element.
returns
the column reference with the provided transformation.

To do
scaladoc link (issue #135)
See also
org.apache.spark.sql.functions.transform
def union(cols: DoricColumn[F[T]]*): DoricColumn[F[T]]

Returns an array of the elements in the union of the given N arrays, without duplicates.
Returns an array of the elements in the union of the given N arrays, without duplicates.

See also
org.apache.spark.sql.functions.array_union
final def wait(): Unit

Definition Classes
AnyRef
Annotations
@throws( ... )
final def wait(arg0: Long, arg1: Int): Unit

Definition Classes
AnyRef
Annotations
@throws( ... )
final def wait(arg0: Long): Unit

Definition Classes
AnyRef
Annotations
@throws( ... )
def zip(other: DoricColumn[F[T]], others: DoricColumn[F[T]]*): DoricColumn[F[Row]]

Returns a merged array of structs in which the N-th struct contains all N-th values of input arrays.
Returns a merged array of structs in which the N-th struct contains all N-th values of input arrays.

See also
org.apache.spark.sql.functions.arrays_zip
def zipWith[T2](col2: DoricColumn[F[T2]]): Zipper[T, T2, F]

Merge two given arrays, element-wise, into a single array using a function.
Merge two given arrays, element-wise, into a single array using a function. If one array is shorter, nulls are appended at the end to match the length of the longer array, before applying the function.
Example:
1. df.select(colArray("val1").zipWith(col("val2"), concat(_, _)))
To do
scaladoc link not available for spark 2.4
See also
org.apache.spark.sql.functions.zip_with
def zipWithIndex(indexName: CName = "index".cname, valueName: CName = "value".cname): DoricColumn[F[Row]]

DORIC EXCLUSIVE! Given any array[e] column this method will return a new array struct[i, e] column, where the first element is the index and the second element is the value itself
def →[B](y: B): (ArrayColumnSyntax[T, F], B)

Implicit information
This member is added by an implicit conversion from ArrayColumnSyntax[T, F] to ArrowAssoc[ArrayColumnSyntax[T, F]] performed by method ArrowAssoc in scala.Predef.
Definition Classes
ArrowAssoc

Related Doc: package ArrayColumns

implicit class ArrayColumnSyntax[T, F[_]] extends AnyRef

Instance Constructors

new ArrayColumnSyntax(col: DoricColumn[F[T]])(implicit arg0: CollectionType[F])

Value Members

final def !=(arg0: Any): Boolean

final def ##(): Int

def +(other: String): String

def ->[B](y: B): (ArrayColumnSyntax[T, F], B)

final def ==(arg0: Any): Boolean

def aggregate[A](zero: DoricColumn[A])(merge: (DoricColumn[A], DoricColumn[T]) ⇒ DoricColumn[A]): DoricColumn[A]

def aggregateWT[A, B](zero: DoricColumn[A])(merge: (DoricColumn[A], DoricColumn[T]) ⇒ DoricColumn[A], finish: (DoricColumn[A]) ⇒ DoricColumn[B]): DoricColumn[B]

final def asInstanceOf[T0]: T0

def clone(): AnyRef

def contains[A](value: DoricColumn[A]): BooleanColumn

def distinct: ArrayColumn[T]

def elementAt(pos: IntegerColumn): DoricColumn[T]

def ensuring(cond: (ArrayColumnSyntax[T, F]) ⇒ Boolean, msg: ⇒ Any): ArrayColumnSyntax[T, F]

def ensuring(cond: (ArrayColumnSyntax[T, F]) ⇒ Boolean): ArrayColumnSyntax[T, F]

def ensuring(cond: Boolean, msg: ⇒ Any): ArrayColumnSyntax[T, F]

def ensuring(cond: Boolean): ArrayColumnSyntax[T, F]

final def eq(arg0: AnyRef): Boolean

def equals(arg0: Any): Boolean

def except(col2: ArrayColumn[T]): ArrayColumn[T]

def exists(fun: (DoricColumn[T]) ⇒ BooleanColumn): BooleanColumn

def explode: DoricColumn[T]

def explodeOuter: DoricColumn[T]

def filter(p: (DoricColumn[T]) ⇒ BooleanColumn): DoricColumn[F[T]]

def finalize(): Unit

def formatted(fmtstr: String): String

final def getClass(): Class[_]

def getIndex(n: Int): DoricColumn[T]

def hashCode(): Int

def intersect(col2: ArrayColumn[T]): ArrayColumn[T]

final def isInstanceOf[T0]: Boolean

def join(delimiter: StringColumn): StringColumn

def join(delimiter: StringColumn, nullReplacement: StringColumn): StringColumn

def lit(implicit l: Location): LiteralDoricColumn[ArrayColumnSyntax[T, F]]

def mapFromArrays[V](values: DoricColumn[F[V]]): MapColumn[T, V]

def max: DoricColumn[T]

def min: DoricColumn[T]

final def ne(arg0: AnyRef): Boolean

final def notify(): Unit

final def notifyAll(): Unit

def overlaps(col2: DoricColumn[F[T]]): BooleanColumn

def posExplode: DoricColumn[Row]

def posExplodeOuter: DoricColumn[Row]

def positionOf[B](col2: DoricColumn[B]): LongColumn

def remove[B](col2: DoricColumn[B]): ArrayColumn[T]

def reverse: DoricColumn[F[T]]

def shuffle: DoricColumn[F[T]]

def size: IntegerColumn

def slice(start: IntegerColumn, length: IntegerColumn): DoricColumn[F[T]]

def sort(asc: BooleanColumn): DoricColumn[F[T]]

def sortAscNullsFirst: DoricColumn[F[T]]

def sortAscNullsLast: DoricColumn[F[T]]

final def synchronized[T0](arg0: ⇒ T0): T0

def toJson(options: Map[String, String] = Map.empty): StringColumn

def toMap[V](values: DoricColumn[F[V]]): MapColumn[T, V]

def toString(): String

def transform[A](fun: (DoricColumn[T]) ⇒ DoricColumn[A]): DoricColumn[F[A]]

def transformWithIndex[A](fun: (DoricColumn[T], IntegerColumn) ⇒ DoricColumn[A]): DoricColumn[F[A]]

def union(cols: DoricColumn[F[T]]*): DoricColumn[F[T]]

final def wait(): Unit

final def wait(arg0: Long, arg1: Int): Unit

final def wait(arg0: Long): Unit

def zip(other: DoricColumn[F[T]], others: DoricColumn[F[T]]*): DoricColumn[F[Row]]

def zipWith[T2](col2: DoricColumn[F[T2]]): Zipper[T, T2, F]

def zipWithIndex(indexName: CName = "index".cname, valueName: CName = "value".cname): DoricColumn[F[Row]]

def →[B](y: B): (ArrayColumnSyntax[T, F], B)

Inherited from AnyRef

Inherited from Any

Inherited by implicit conversion LiteralOps from ArrayColumnSyntax[T, F] to doric.LiteralOps[ArrayColumnSyntax[T, F]]

Inherited by implicit conversion any2stringadd from ArrayColumnSyntax[T, F] to any2stringadd[ArrayColumnSyntax[T, F]]

Inherited by implicit conversion StringFormat from ArrayColumnSyntax[T, F] to StringFormat[ArrayColumnSyntax[T, F]]

Inherited by implicit conversion Ensuring from ArrayColumnSyntax[T, F] to Ensuring[ArrayColumnSyntax[T, F]]

Inherited by implicit conversion ArrowAssoc from ArrayColumnSyntax[T, F] to ArrowAssoc[ArrayColumnSyntax[T, F]]

Array Type

Ungrouped