T
- the numeric type e.g. Integer, Double@Deprecated @PublicEvolving public class NumericColumnSummary<T> extends ColumnSummary implements Serializable
Some values are considered "missing" where "missing" is defined as null, NaN, or Infinity. These values are ignored in some calculations like mean, variance, and standardDeviation.
Uses the Kahan summation algorithm to avoid numeric instability when computing variance. The algorithm is described in: "Scalable and Numerically Stable Descriptive Statistics in SystemML", Tian et al, International Conference on Data Engineering 2012.
构造器和说明 |
---|
NumericColumnSummary(long nonMissingCount,
long nullCount,
long nanCount,
long infinityCount,
T min,
T max,
T sum,
Double mean,
Double variance,
Double standardDeviation)
已过时。
|
限定符和类型 | 方法和说明 |
---|---|
long |
getInfinityCount()
已过时。
Number of values that are positive or negative infinity.
|
T |
getMax()
已过时。
|
Double |
getMean()
已过时。
Null, NaN, and Infinite values are ignored in this calculation.
|
T |
getMin()
已过时。
|
long |
getMissingCount()
已过时。
The number of "missing" values where "missing" is defined as null, NaN, or Infinity.
|
long |
getNanCount()
已过时。
Number of values that are NaN.
|
long |
getNonMissingCount()
已过时。
The number of values that are not null, NaN, or Infinity.
|
long |
getNonNullCount()
已过时。
The number of non-null values in this column.
|
long |
getNullCount()
已过时。
The number of null values in this column.
|
Double |
getStandardDeviation()
已过时。
Standard Deviation is a measure of variation in a set of numbers.
|
T |
getSum()
已过时。
|
Double |
getVariance()
已过时。
Variance is a measure of how far a set of numbers are spread out.
|
String |
toString()
已过时。
|
containsNonNull, containsNull, getTotalCount
public long getMissingCount()
These values are ignored in some calculations like mean, variance, and standardDeviation.
public long getNonMissingCount()
public long getNonNullCount()
getNonNullCount
在类中 ColumnSummary
public long getNullCount()
ColumnSummary
getNullCount
在类中 ColumnSummary
public long getNanCount()
(always zero for types like Short, Integer, Long)
public long getInfinityCount()
(always zero for types like Short, Integer, Long)
public T getMin()
public T getMax()
public T getSum()
public Double getMean()
public Double getVariance()
Null, NaN, and Infinite values are ignored in this calculation.
public Double getStandardDeviation()
Null, NaN, and Infinite values are ignored in this calculation.
Copyright © 2014–2024 The Apache Software Foundation. All rights reserved.