org.apache.spark.mllib

stat

package stat

Visibility
  1. Public
  2. All

Type Members

  1. class MultivariateOnlineSummarizer extends MultivariateStatisticalSummary with Serializable

    :: DeveloperApi :: MultivariateOnlineSummarizer implements MultivariateStatisticalSummary to compute the mean, variance, minimum, maximum, counts, and nonzero counts for samples in sparse or dense vector format in a online fashion.

    :: DeveloperApi :: MultivariateOnlineSummarizer implements MultivariateStatisticalSummary to compute the mean, variance, minimum, maximum, counts, and nonzero counts for samples in sparse or dense vector format in a online fashion.

    Two MultivariateOnlineSummarizer can be merged together to have a statistical summary of the corresponding joint dataset.

    A numerically stable algorithm is implemented to compute sample mean and variance: Reference: variance-wiki Zero elements (including explicit zero values) are skipped when calling add(), to have time complexity O(nnz) instead of O(n) for each column.

    Annotations
    @DeveloperApi()
  2. trait MultivariateStatisticalSummary extends AnyRef

    Trait for multivariate statistical summary of a data matrix.

Value Members

  1. object Statistics

    API for statistical functions in MLlib.

    API for statistical functions in MLlib.

    Annotations
    @Experimental()
  2. package test

Ungrouped