LuongAttention

Instance Constructors

new LuongAttention(memory: Output, memoryWeights: Output, memorySequenceLengths: Output = null, scaleFactor: Output = null, probabilityFn: (Output) ⇒ Output = NN.softmax(_, name = "Probability"), scoreMaskValue: Output = Float.NegativeInfinity, name: String = "LuongAttention")

memory
Memory to query; usually the output of an RNN encoder. Each tensor in the memory should be shaped [batchSize, maxTime, ...].
memoryWeights
Weights tensor with which the memory is multiplied to produce the attention keys.
memorySequenceLengths
Sequence lengths for the batch entries in the memory. If provided, the memory tensor rows are masked with zeros for values past the respective sequence lengths.
scaleFactor
Scalar tensor with which the scores are multiplied before used to compute attention probabilities.
probabilityFn
Optional function that converts computed scores to probabilities. Defaults to the softmax function. A potentially useful alternative is the hardmax function.
scoreMaskValue
Mask value to use for the score before passing it to probabilityFn. Defaults to negative infinity. Note that this value is only used if memorySequenceLengths is not null.
name
Name prefix to use for all created ops.

Value Members

final def !=(arg0: Any): Boolean

Definition Classes
AnyRef → Any
final def ##(): Int

Definition Classes
AnyRef → Any
final def ==(arg0: Any): Boolean

Definition Classes
AnyRef → Any
def alignment(query: Output, previousState: Output): (Output, Output)

Computes an alignment tensor given the provided query and previous alignment tensor.
Computes an alignment tensor given the provided query and previous alignment tensor.
The previous alignment tensor is important for attention mechanisms that use the previous alignment to calculate the attention at the next time step, such as monotonic attention mechanisms.
TODO: Figure out how to generalize the "next state" functionality.
query
Query tensor.
previousState
Previous alignment tensor.
returns
Tuple containing the alignment tensor and the next attention state.

Definition Classes
SimpleAttention → Attention
lazy val alignmentSize: Output

Definition Classes
Attention
final def asInstanceOf[T0]: T0

Definition Classes
Any
lazy val batchSize: Output

Definition Classes
Attention
val checkInnerDimensionsDefined: Boolean

Definition Classes
SimpleAttention → Attention
def clone(): AnyRef

Attributes
protected[java.lang]
Definition Classes
AnyRef
Annotations
@throws( ... )
lazy val dataType: types.DataType

Definition Classes
Attention
final def eq(arg0: AnyRef): Boolean

Definition Classes
AnyRef
def equals(arg0: Any): Boolean

Definition Classes
AnyRef → Any
def finalize(): Unit

Attributes
protected[java.lang]
Definition Classes
AnyRef
Annotations
@throws( classOf[java.lang.Throwable] )
final def getClass(): Class[_]

Definition Classes
AnyRef → Any
def hashCode(): Int

Definition Classes
AnyRef → Any
lazy val initialAlignment: Output

Initial alignment value.
Initial alignment value.
This is important for attention mechanisms that use the previous alignment to calculate the alignment at the next time step (e.g., monotonic attention).
The default behavior is to return a tensor of all zeros.

Definition Classes
Attention
def initialState: Output

Initial state value.
Initial state value.
This is important for attention mechanisms that use the previous alignment to calculate the alignment at the next time step (e.g., monotonic attention).
The default behavior is to return the same output as initialAlignment.

Definition Classes
SimpleAttention → Attention
final def isInstanceOf[T0]: Boolean

Definition Classes
Any
lazy val keys: Output

Definition Classes
LuongAttention → Attention
val memory: Output

Memory to query; usually the output of an RNN encoder.
Memory to query; usually the output of an RNN encoder. Each tensor in the memory should be shaped [batchSize, maxTime, ...].

Attributes
protected
Definition Classes
LuongAttention → SimpleAttention → Attention
val memorySequenceLengths: Output

Sequence lengths for the batch entries in the memory.
Sequence lengths for the batch entries in the memory. If provided, the memory tensor rows are masked with zeros for values past the respective sequence lengths.

Attributes
protected
Definition Classes
LuongAttention → SimpleAttention → Attention
val memoryWeights: Output

Weights tensor with which the memory is multiplied to produce the attention keys.
Weights tensor with which the memory is multiplied to produce the attention keys.

Attributes
protected
val name: String

Name prefix to use for all created ops.
Name prefix to use for all created ops.

Definition Classes
LuongAttention → SimpleAttention → Attention
final def ne(arg0: AnyRef): Boolean

Definition Classes
AnyRef
final def notify(): Unit

Definition Classes
AnyRef
final def notifyAll(): Unit

Definition Classes
AnyRef
def probability(score: Output, previousAlignment: Output): Output

Computes alignment probabilities for score.
Computes alignment probabilities for score.
score
Alignment score tensor.
returns
Alignment probabilities tensor.

Attributes
protected
Definition Classes
LuongAttention → Attention
val probabilityFn: (Output) ⇒ Output

Optional function that converts computed scores to probabilities.
Optional function that converts computed scores to probabilities. Defaults to the softmax function. A potentially useful alternative is the hardmax function.

Attributes
protected
val scaleFactor: Output

Scalar tensor with which the scores are multiplied before used to compute attention probabilities.
Scalar tensor with which the scores are multiplied before used to compute attention probabilities.

Attributes
protected
def score(query: Output, previousAlignment: Output): Output

Computes an alignment score for query.
Computes an alignment score for query.
query
Query tensor.
returns
Score tensor.

Attributes
protected
Definition Classes
LuongAttention → Attention
Annotations
@throws( ... )
val scoreMaskValue: Output

Mask value to use for the score before passing it to probabilityFn.
Mask value to use for the score before passing it to probabilityFn. Defaults to negative infinity. Note that this value is only used if memorySequenceLengths is not null.

Definition Classes
LuongAttention → SimpleAttention → Attention
def stateSize: core.Shape

Definition Classes
SimpleAttention → Attention
final def synchronized[T0](arg0: ⇒ T0): T0

Definition Classes
AnyRef
def toString(): String

Definition Classes
AnyRef → Any
lazy val values: Output

Definition Classes
Attention
final def wait(): Unit

Definition Classes
AnyRef
Annotations
@throws( ... )
final def wait(arg0: Long, arg1: Int): Unit

Definition Classes
AnyRef
Annotations
@throws( ... )
final def wait(arg0: Long): Unit

Definition Classes
AnyRef
Annotations
@throws( ... )

Related Docs: object LuongAttention | package attention

class LuongAttention extends SimpleAttention

Instance Constructors

new LuongAttention(memory: Output, memoryWeights: Output, memorySequenceLengths: Output = null, scaleFactor: Output = null, probabilityFn: (Output) ⇒ Output = NN.softmax(_, name = "Probability"), scoreMaskValue: Output = Float.NegativeInfinity, name: String = "LuongAttention")

Value Members

final def !=(arg0: Any): Boolean

final def ##(): Int

final def ==(arg0: Any): Boolean

def alignment(query: Output, previousState: Output): (Output, Output)

lazy val alignmentSize: Output

final def asInstanceOf[T0]: T0

lazy val batchSize: Output

val checkInnerDimensionsDefined: Boolean

def clone(): AnyRef

lazy val dataType: types.DataType

final def eq(arg0: AnyRef): Boolean

def equals(arg0: Any): Boolean

def finalize(): Unit

final def getClass(): Class[_]

def hashCode(): Int

lazy val initialAlignment: Output

def initialState: Output

final def isInstanceOf[T0]: Boolean

lazy val keys: Output

val memory: Output

val memorySequenceLengths: Output

val memoryWeights: Output

val name: String

final def ne(arg0: AnyRef): Boolean

final def notify(): Unit

final def notifyAll(): Unit

def probability(score: Output, previousAlignment: Output): Output

val probabilityFn: (Output) ⇒ Output

val scaleFactor: Output

def score(query: Output, previousAlignment: Output): Output

val scoreMaskValue: Output

def stateSize: core.Shape

final def synchronized[T0](arg0: ⇒ T0): T0

def toString(): String

lazy val values: Output

final def wait(): Unit

final def wait(arg0: Long, arg1: Int): Unit

final def wait(arg0: Long): Unit

Inherited from SimpleAttention

Inherited from Attention[Output, core.Shape]

Inherited from AnyRef

Inherited from Any

Ungrouped