Softmax

Instance Constructors

new Softmax(weights: Array[Double], nClasses: Int, inputDataLength: Int, learnRate: Double, stuckIterationLimit: Int = 10000, batchSize: Int = 1, useStable: Boolean = true)

weights
Initial weights (pre-trained or random) weights(i)(j) is the weight for evidence i based on input parameter j i = 0..nClasses j = 0..nInputs (where nInputs = inputLength + 1 for pseudo-input used for bias handling) Modeled as an array of concatenated rows.
learnRate
Between 0 and 1. As we use AdaGrad, the effective rate will gradually decrease. You can start relatively high: 0.01 - 0.1
stuckIterationLimit
How many more samples to try if there is no improvement. Set based on input size, your patience and target accuaracy.
batchSize
Mini-batch size for "mini-batch SGD". Most of the time, 1 is the best size.
useStable
Use a numerically stable softmax version, more tolerant to broad input ranges. Recommended most of the time.

Value Members

final def !=(arg0: Any): Boolean

Definition Classes
AnyRef → Any
final def ##(): Int

Definition Classes
AnyRef → Any
final def ==(arg0: Any): Boolean

Definition Classes
AnyRef → Any
final def asInstanceOf[T0]: T0

Definition Classes
Any
def clone(): AnyRef

Attributes
protected[java.lang]
Definition Classes
AnyRef
Annotations
@throws( ... )
final def eq(arg0: AnyRef): Boolean

Definition Classes
AnyRef
def equals(arg0: Any): Boolean

Definition Classes
AnyRef → Any
def finalize(): Unit

Attributes
protected[java.lang]
Definition Classes
AnyRef
Annotations
@throws( classOf[java.lang.Throwable] )
final def getClass(): Class[_]

Definition Classes
AnyRef → Any
def gradientsOfLoss(): Unit

Gradients of the loss function used for backprop weight updates.
Gradients of the loss function used for backprop weight updates. See ref.
We have nClasses gradient vectors of the form:
grad(w(j)) = - (1/m) * sum(x * (target(j) - predicted(j))) + lambda * w(j) where j = 0...nClasses; sum is over a batch of m examples; w(j) = vector of weights for input parameter j; x = input(i) = input vector (iterates over a batch); target(j) = known value (0 or 1) indicating if input x belongs to class j (iterates over a batch); predicted(j) = predicted likelihood of "input x belongs to class j" (iterates over a batch); lambda > 0 is the weight decay parameter necessary for convergence.
def hashCode(): Int

Definition Classes
AnyRef → Any
final def isInstanceOf[T0]: Boolean

Definition Classes
Any
def learn(examples: Traversable[(Array[Double], Array[Double])]): Softmax
def learnSeq(examples: Traversable[(Array[Double], Array[Double])]): Softmax

Convenience shortcut for feeding a sequence of examples, splitting it into suitable batches.
lazy val logger: Logger

Attributes
protected
Definition Classes
LazyLogging
val nInputs: Int
final def ne(arg0: AnyRef): Boolean

Definition Classes
AnyRef
final def notify(): Unit

Definition Classes
AnyRef
final def notifyAll(): Unit

Definition Classes
AnyRef
def predict(x: Array[Double]): Array[Double]

Predict the likelihoods of each class given the inputs.
def softmax(x: Array[Double], idx: Int): Array[Double]

y = softmax(x) = normalize(exp(x)) = exp(x(i)) / sum (exp(x))
y = softmax(x) = normalize(exp(x)) = exp(x(i)) / sum (exp(x))
Naive "by the book" version; only works with normalized, stable input.
idx
The index of the currently processed example from the mini-batch. Used to save memory by writing the result directly to predicted(idx).
def softmaxStable(x: Array[Double], idx: Int): Array[Double]

This version works around some numeric issues (overflow/underflow).
This version works around some numeric issues (overflow/underflow).
Original form: y(i) = exp(x(i)) / sum (exp(x))
Stable form: y(i) = exp( x(i) - logSumExp(x) ) where logSumExp(x) = max(x) + log(sum(x-max(x)))
idx
The index of the currently processed example from the mini-batch. Used to save memory by writing the result directly to predicted(idx).
final def synchronized[T0](arg0: ⇒ T0): T0

Definition Classes
AnyRef
def toString(): String

Definition Classes
AnyRef → Any
final def wait(): Unit

Definition Classes
AnyRef
Annotations
@throws( ... )
final def wait(arg0: Long, arg1: Int): Unit

Definition Classes
AnyRef
Annotations
@throws( ... )
final def wait(arg0: Long): Unit

Definition Classes
AnyRef
Annotations
@throws( ... )
val weights: Array[Double]

Initial weights (pre-trained or random) weights(i)(j) is the weight for evidence i based on input parameter j i = 0..nClasses j = 0..nInputs (where nInputs = inputLength + 1 for pseudo-input used for bias handling) Modeled as an array of concatenated rows.

Related Docs: object Softmax | package softmax

class Softmax extends LazyLogging

Instance Constructors

new Softmax(weights: Array[Double], nClasses: Int, inputDataLength: Int, learnRate: Double, stuckIterationLimit: Int = 10000, batchSize: Int = 1, useStable: Boolean = true)

Value Members

final def !=(arg0: Any): Boolean

final def ##(): Int

final def ==(arg0: Any): Boolean

final def asInstanceOf[T0]: T0

def clone(): AnyRef

final def eq(arg0: AnyRef): Boolean

def equals(arg0: Any): Boolean

def finalize(): Unit

final def getClass(): Class[_]

def gradientsOfLoss(): Unit

def hashCode(): Int

final def isInstanceOf[T0]: Boolean

def learn(examples: Traversable[(Array[Double], Array[Double])]): Softmax

def learnSeq(examples: Traversable[(Array[Double], Array[Double])]): Softmax

lazy val logger: Logger

val nInputs: Int

final def ne(arg0: AnyRef): Boolean

final def notify(): Unit

final def notifyAll(): Unit

def predict(x: Array[Double]): Array[Double]

def softmax(x: Array[Double], idx: Int): Array[Double]

def softmaxStable(x: Array[Double], idx: Int): Array[Double]

final def synchronized[T0](arg0: ⇒ T0): T0

def toString(): String

final def wait(): Unit

final def wait(arg0: Long, arg1: Int): Unit

final def wait(arg0: Long): Unit

val weights: Array[Double]

Inherited from LazyLogging

Inherited from AnyRef

Inherited from Any

Ungrouped