Package

com.eharmony.aloha

dataset

Permalink

package dataset

This package name is kept to eHarmony compatibility.

Linear Supertypes
AnyRef, Any
Ordering
  1. Alphabetic
  2. By Inheritance
Inherited
  1. dataset
  2. AnyRef
  3. Any
  1. Hide All
  2. Show All
Visibility
  1. Public
  2. All

Type Members

  1. type CharSeqRowCreator[-A] = RowCreator[A, CharSequence]

    Permalink
  2. trait CompilerFailureMessages extends AnyRef

    Permalink
  3. sealed trait CovariateProducer[Density] extends AnyRef

    Permalink

    Helper to produce functions used to the create the covariate data passed to the dataset generators.

  4. trait DenseCovariateProducer extends CovariateProducer[Double]

    Permalink
  5. final case class DenseFeatureExtractorFunction[-A](features: IndexedSeq[(String, GenAggFunc[A, Double])]) extends FeatureExtractorFunction[A, Double] with Product with Serializable

    Permalink
  6. trait DvProducer extends AnyRef

    Permalink

    Helper to provide a way to construct dependent variables.

  7. trait FeatureExtractorFunction[-A, Density] extends (A) ⇒ (MissingAndErroneousFeatureInfo, IndexedSeq[Density])

    Permalink

    A function that takes a value and returns extracted features and information on missing and erring features.

  8. trait LabelRowCreator[-A, +B] extends RowCreator[A, B]

    Permalink
  9. case class MissingAndErroneousFeatureInfo(missingFeatures: Seq[String], errorFeatures: Seq[String]) extends Product with Serializable

    Permalink
  10. trait RowCreator[-A, +B] extends (A) ⇒ (MissingAndErroneousFeatureInfo, B) with Serializable

    Permalink
  11. final case class RowCreatorBuilder[A, B, Impl <: RowCreator[A, B]](semantics: CompiledSemantics[A], producers: List[RowCreatorProducer[A, B, Impl]]) extends AlohaReadable[Try[Impl]] with Logging with Product with Serializable

    Permalink

    Given a semantics, json specification and an ordered sequence of RowCreatorProducers, find the first producer that applies to creating a Spec from the json specification and use it to instantiate the RowCreator object.

    Given a semantics, json specification and an ordered sequence of RowCreatorProducers, find the first producer that applies to creating a Spec from the json specification and use it to instantiate the RowCreator object.

    A

    the type consumed by the RowCreator produced by this Readable.

    B

    the type produced by the RowCreator produced by this Readable.

    Impl

    the implementation of RowCreator.

    semantics

    a Semantics to be used for creating the RowCreator.

    producers

    an ordered sequence of RowCreatorProducers. These producers form the basis of a chain of responsibility pattern. Therefore, the order is important.

  12. trait RowCreatorProducer[A, +B, +Impl <: RowCreator[A, B]] extends AnyRef

    Permalink

    RowCreatorProducer is used to create different kinds of RowCreator instances.

    RowCreatorProducer is used to create different kinds of RowCreator instances.

    Classes that extend RowCreatorProducer should (try to) have only zero-argument constructors.

    This is because RowCreator instances should ideally only be parametrized by the JSON specification. Otherwise, one JSON specification could produce non-equivalent RowCreator instances in different environments.

    This statelessness is a design goal and should only be broken with good reason.

    One of the reasons this rule will likely be broken is that things like context bounds on a type parameter to a RowCreatorProducer become constructor arguments. So if a RowCreatorProducer is parametrized by a type that requires a type class to decode the JSON representation, this rule would be broken.

    Another example might be in training multi-label models. Whereas in binary classifiers the labels values are known automatically (because they are isomorphic to the set {true, false}), the label set isn't known a priori (because each problem codomain might be different). Therefore, we might ask for the set of labels to expect.

    NOTE: com.eharmony.aloha.dataset.RowCreatorProducerTest will be used to control which RowCreatorProducers can accept parameters.

    A

    type of input passed to the RowCreator.

    B

    type of output returned from the RowCreator.

    Impl

    implementation of the RowCreator that is returned by the getRowCreator function.

  13. trait RowCreatorProducerName extends AnyRef

    Permalink

    A mixin that gives a standardized name to RowCreatorProducer instances.

  14. trait SparseCovariateProducer extends CovariateProducer[Iterable[(String, Double)]]

    Permalink
  15. final case class SparseFeatureExtractorFunction[-A](features: IndexedSeq[(String, GenAggFunc[A, Iterable[(String, Double)]])]) extends FeatureExtractorFunction[A, Iterable[(String, Double)]] with Product with Serializable

    Permalink
  16. trait StatefulRowCreator[-A, +B, S] extends Serializable

    Permalink

    A row creator that requires state.

    A row creator that requires state. This state should be modeled functionally, meaning implementations should be referentially transparent.

    Created by ryan.deak on 11/2/17.

  17. trait StatefulRowCreatorProducer[A, +B, S, +Impl <: StatefulRowCreator[A, B, S]] extends AnyRef

    Permalink

    Created by deaktator on 11/6/17.

  18. final case class StringFeatureExtractorFunction[-A](features: IndexedSeq[(String, GenAggFunc[A, String])]) extends FeatureExtractorFunction[A, String] with Product with Serializable

    Permalink
  19. final case class StringSeqFeatureExtractorFunction[-A](features: IndexedSeq[(String, GenAggFunc[A, Seq[String]])]) extends FeatureExtractorFunction[A, Seq[String]] with Product with Serializable

    Permalink

Value Members

  1. object RowCreatorBuilder extends Serializable

    Permalink
  2. object RowCreatorType extends Enumeration

    Permalink

    This is for backward compatibility, but should someday be removed and SpecBuilder should be updated to remove it too.

    This is for backward compatibility, but should someday be removed and SpecBuilder should be updated to remove it too. Eventually, the spec type should always appear in the JSON used to create the JsonSpec.

  3. package csv

    Permalink
  4. package json

    Permalink
  5. package libsvm

    Permalink
  6. package vw

    Permalink

Inherited from AnyRef

Inherited from Any

Ungrouped