train

Type Members

class AEType extends ManipulationType[ScalarMatrix, Null]

Input Operation : Vector as Input & Auto Encoder Training (no output type)

Example:

var make = new AEType(error = CrossEntropyErr)
                      var corruptedIn = make corrupted in
                      var out = make onewayTrip (net, corruptedIn)

trait Corruption extends (ScalarMatrix) ⇒ ScalarMatrix with Serializable

Type of Corruption
case class DistBeliefCriteria(miniBatch: Int = 100, negSamplingRatio: Int = 0, submitInterval: Duration = 1.minute, updateStep: Int = 2, fetchStep: Int = 10, numCores: Int = 1, storageLevel: StorageLevel = StorageLevel.DISK_ONLY_2) extends TrainingCriteria with Product with Serializable

Criteria: How to train (for DistBeliefTrainStyle)
Criteria: How to train (for DistBeliefTrainStyle)
This case class defines how to train the network. Training parameter is defined in this class.
miniBatch
size of mini-batch (default 100)
negSamplingRatio
ratio of negative samples per positive instance. (default 0)
submitInterval
Time interval between batch submission. (default 1.minute)
updateStep
number of mini-batches between update (default 2)
fetchStep
number of mini-batches between fetching (default 10)
numCores
number of v-cores in the spark cluster. (default 1)
storageLevel
StorageLevel that will be used in Spark. (default DISK_ONLY_2)

Note
We recommend set numCores as similar as possible with allocated spark v-cores.
class DistBeliefTrainStyle[IN, OUT] extends MultiThreadTrainStyle[IN, OUT]

Train Style : Semi-DistBelief Style, Spark-based.
Train Style : Semi-DistBelief Style, Spark-based.

Note
Unlike with DistBelief, this trainer do updates and fetch by master not the workers.
case class DroppingCorruption(presence: Float = 0.95f) extends Corruption with Product with Serializable

Input Corruption: Drop input as zero.
Input Corruption: Drop input as zero.
If network uses drop-out training, we recommend that you do not use this.
presence
probability of not-dropped. (default 95% = 0.95)
Example:
1. var corrupt = DroppingCorruption(presence = 0.99) var corrupted = corrupt(vector)
Note
If the presence probability is P%, then this corruption leaves P% entries of the matrix
case class GaussianCorruption(mean: Double = 0.0, variance: Double = 0.1) extends Corruption with Product with Serializable

Input Corruption: Gaussian
Input Corruption: Gaussian
mean
Mean of noise (default 0.0)
variance
Variance of noise (default 0.1)
Example:
1. var corrupt = GaussianCorruption(variance = 0.1) var corrupted = corrupt(vector)
trait ManipulationType[IN, OUT] extends Serializable

Trait that describes how to convert input into corrupted matrix
Trait that describes how to convert input into corrupted matrix
Input operation corrupts the given input, and apply network propagations onto matrix representation of input
IN
the type of input
OUT
the type of output
class MultiThreadTrainStyle[IN, OUT] extends TrainStyle[IN, OUT]

Trainer : Stochastic-Style, Multi-Threaded using Spark.
Trainer : Stochastic-Style, Multi-Threaded using Spark.

Note
This is not a implementation using DistBelief Paper. This is between DistBeliefTrainStyle(DBTS) and SingleThreadTrainStyle(STTS). The major difference is whether "updating" is asynchronous(DBTS) or not(MTTS).
class RAEType extends TreeType

Input Operation : VectorTree as Input & Recursive Auto-Encoder Training (no output type)
Input Operation : VectorTree as Input & Recursive Auto-Encoder Training (no output type)
Example:
1. var make = new RAEType(error = CrossEntropyErr) var corruptedIn = make corrupted in var out = make onewayTrip (net, corruptedIn)
Note
This implementation designed as a replica of the traditional RAE in this paper
,
We recommend that you should not apply this method to non-AutoEncoder tasks
class RandomEqualPartitioner extends Partitioner

Spark Partitioner that gives almost-equal partitions.
Spark Partitioner that gives almost-equal partitions.

Note
Use this with RDD.zipWithUniqueId()
case class SimpleTrainingCriteria(miniBatch: Int = 100, negSamplingRatio: Int = 0) extends TrainingCriteria with Product with Serializable

Criteria: How to train (for SingleThreadTrainStyle)
Criteria: How to train (for SingleThreadTrainStyle)
This case class defines how to train the network. Training parameter is defined in this class.
miniBatch
size of mini-batch (default 100)
negSamplingRatio
ratio of negative samples per positive instance. (default 0)
class SingleThreadTrainStyle[IN, OUT] extends TrainStyle[IN, OUT]

Trainer : Stochastic-Style, Single-Threaded
class StandardRAEType extends TreeType

Input Operation : VectorTree as Input & Recursive Auto-Encoder Training (no output type)
Input Operation : VectorTree as Input & Recursive Auto-Encoder Training (no output type)
Example:
1. var make = new RAEType(error = CrossEntropyErr) var corruptedIn = make corrupted in var out = make onewayTrip (net, corruptedIn)
Note
This implementation designed as a replica of the standard RAE (RAE + normalization) in this paper
,
We recommend that you should not apply this method to non-AutoEncoder tasks
case class StoppingCriteria(maxIter: Int = 100000, patience: Int = 5000, patienceStep: Int = 2, improveThreshold: Float = 0.995f, lossThreshold: Float = 0.0001f, validationFreq: Int = 100) extends Serializable with Product

Criteria: When to stop training
Criteria: When to stop training
This case class defines when to stop training. Training stops if one of the following condition is satisfied.
- #Iteration ≥ maxIter
  - #Iteration ≥ current patience value, which is calculated by max(patience, bestIteration * patienceStep)
  - Amount of loss < lossThreshold
Validation is done for each validationFreq iterations, and whenever current/best loss ratio below improveThreshold, that iteration is marked as best iteration.
maxIter
maximum mini-batch iteration count (default 100,000)
patience
default patience count (default 5,000)
patienceStep
multiplier for calculating patience (default x2)
improveThreshold
threshold that iteration is marked as "improved" (default 95% = 0.95)
lossThreshold
maximum-tolerant loss value. (default 0.0001)
validationFreq
step count for validation (default 100)
trait TrainStyle[IN, OUT] extends Serializable

Trait that describes style of training
Trait that describes style of training
This trait controls how to train, i.e. Single-threaded or Distributed.
IN
the type of input
OUT
the type of output

class Trainer[IN, OUT] extends Serializable

General Trainer Implementation.

This class trains with help of Training Style and Input Operation.

IN: the type of input. Currently, ScalarMatrix and DAG are supported
OUT: the type of output Currently, ScalarMatrix and Null are supported

Note: To train an autoencoder, you can provide same training set as validation set.
,
This trainer is generalized class. Further implementation, you should see several styles.

trait TrainingCriteria extends Serializable

Trait that describes Training Criteria
trait TreeType extends ManipulationType[BinaryTree, Null]

Trait of Input Operation : VectorTree as Input.
Trait of Input Operation : VectorTree as Input. This is an Abstract Implementation
class URAEType extends TreeType

Input Operation : VectorTree as Input & Unfolding Recursive Auto Encoder Training (no output type)
Input Operation : VectorTree as Input & Unfolding Recursive Auto Encoder Training (no output type)
::Experimental::
Annotations
@Experimental()
Example:
1. var make = new URAEType(error = CrossEntropyErr) var corruptedIn = make corrupted in var out = make onewayTrip (net, corruptedIn)
Note
This is designed for Unfolding RAE, in this paper
,
This cannot be applied into non-AutoEncoder tasks

class VectorType extends ManipulationType[ScalarMatrix, ScalarMatrix]

Input Operation : Vector as Input and output

Example:

var make = new VectorType(error = CrossEntropyErr)
           var corruptedIn = make corrupted in
           var out = make onewayTrip (net, corruptedIn)

Value Members

object AsyncAwait extends Serializable

Non-blocking await
object NoCorruption extends Corruption with Product with Serializable

Input Corruption: Never corrupts input
Input Corruption: Never corrupts input
Example:
1. var corrupt = NoCorruption(variance = 0.1) var corrupted = corrupt(vector)
object WeightAccumulator extends AccumulatorParam[IndexedSeq[ScalarMatrix]]

Accumulator Param object for DistBelief Train Style.

package train

Type Members

class AEType extends ManipulationType[ScalarMatrix, Null]

trait Corruption extends (ScalarMatrix) ⇒ ScalarMatrix with Serializable

case class DistBeliefCriteria(miniBatch: Int = 100, negSamplingRatio: Int = 0, submitInterval: Duration = 1.minute, updateStep: Int = 2, fetchStep: Int = 10, numCores: Int = 1, storageLevel: StorageLevel = StorageLevel.DISK_ONLY_2) extends TrainingCriteria with Product with Serializable

class DistBeliefTrainStyle[IN, OUT] extends MultiThreadTrainStyle[IN, OUT]

case class DroppingCorruption(presence: Float = 0.95f) extends Corruption with Product with Serializable

case class GaussianCorruption(mean: Double = 0.0, variance: Double = 0.1) extends Corruption with Product with Serializable

trait ManipulationType[IN, OUT] extends Serializable

class MultiThreadTrainStyle[IN, OUT] extends TrainStyle[IN, OUT]

class RAEType extends TreeType

class RandomEqualPartitioner extends Partitioner

case class SimpleTrainingCriteria(miniBatch: Int = 100, negSamplingRatio: Int = 0) extends TrainingCriteria with Product with Serializable

class SingleThreadTrainStyle[IN, OUT] extends TrainStyle[IN, OUT]

class StandardRAEType extends TreeType

case class StoppingCriteria(maxIter: Int = 100000, patience: Int = 5000, patienceStep: Int = 2, improveThreshold: Float = 0.995f, lossThreshold: Float = 0.0001f, validationFreq: Int = 100) extends Serializable with Product

trait TrainStyle[IN, OUT] extends Serializable

class Trainer[IN, OUT] extends Serializable

trait TrainingCriteria extends Serializable

trait TreeType extends ManipulationType[BinaryTree, Null]

class URAEType extends TreeType

class VectorType extends ManipulationType[ScalarMatrix, ScalarMatrix]

Value Members

object AsyncAwait extends Serializable

object NoCorruption extends Corruption with Product with Serializable

object WeightAccumulator extends AccumulatorParam[IndexedSeq[ScalarMatrix]]

Inherited from AnyRef

Inherited from Any

Ungrouped