LSTMBlock (nd4j-api 1.0.0-beta7 API)

java.lang.Object
- org.nd4j.autodiff.functions.DifferentialFunction
- - org.nd4j.linalg.api.ops.DynamicCustomOp
  - - org.nd4j.linalg.api.ops.impl.layers.recurrent.LSTMBlock

All Implemented Interfaces:

CustomOp
```
public class LSTMBlock
extends DynamicCustomOp
```
LSTM layer implemented as a single operation. Implementation of operation for LSTM layer with optional peep hole connections.
S. Hochreiter and J. Schmidhuber. "Long Short-Term Memory". Neural Computation and https://research.google.com/pubs/archive/43905.pdf
Hasim Sak, Andrew Senior, and Francoise Beaufays. "Long short-term memory recurrent neural network architectures for large scale acoustic modeling." INTERSPEECH, 2014.
See also: https://arxiv.org/pdf/1503.04069.pdf

See also LSTMBlockCell - lstmBlockCell op is used internally at C++ level for computation.

Input arrays:
0: max sequence length; long/int64 scalar
1: input [seqLength, bS, inSize] at time t
2: previous/initial cell state [bS, numUnits]
3: previous/initial output [bS, numUnits]
4: Weights - concatenated (input-to-hidden, hidden-to-hidden weights) weights, [(inSize+numUnits), 4*numUnits]
5: weights - cell peephole (t-1) connections to input modulation gate, [numUnits]
6: weights - cell peephole (t-1) connections to forget gate, [numUnits]
7: weights - cell peephole (t) connections to output gate, [numUnits]
8: biases, shape [4*numUnits]

Input integer arguments: set via LSTMConfiguration
0: if not zero, provide peephole connections
1: Data format - 0=TNS=[seqLen,mb,size]; 1=NST=[mb,size,seqLen]; 2=NTS=[mb,seqLen,size]

Input float arguments: set via LSTMConfiguration
0: the bias added to forget gates in order to reduce the scale of forgetting in the beginning of the training
1: clipping value for cell state, if it is not equal to zero, then cell state is clipped

Output arrays:
0: i - Input modulation gate activations, rank 3, shape as per dataFormat
1: c (cs) - Cell state (pre tanh), rank 3, shape as per dataFormat
2: f - Output - forget gate activations, rank 3, shape as per dataFormat
3: o - Output - output gate activations, rank 3, shape as per dataFormat
4: z (ci) - Output - block input, rank 3, shape as per dataFormat
5: h (co) - Cell state, post tanh, rank 3, shape as per dataFormat
6: y (h) - Current cell output, rank 3, shape as per dataFormat

Author:

Alex Black

Nested Class Summary
- Nested classes/interfaces inherited from class org.nd4j.linalg.api.ops.DynamicCustomOp
  DynamicCustomOp.DynamicCustomOpsBuilder

Field Summary
- Fields inherited from class org.nd4j.linalg.api.ops.DynamicCustomOp
  axis, bArguments, dArguments, iArguments, inplaceCall, inputArguments, outputArguments, outputVariables, tArguments
- Fields inherited from class org.nd4j.autodiff.functions.DifferentialFunction
  dimensions, extraArgs, inPlace, sameDiff, scalarValue

Constructor Summary

Constructors
Constructor and Description
`LSTMBlock()`
`LSTMBlock(INDArray x, INDArray cLast, INDArray yLast, INDArray maxTSLength, LSTMWeights lstmWeights, LSTMConfiguration lstmConfiguration)`
`LSTMBlock(@NonNull SameDiff sameDiff, SDVariable maxTSLength, SDVariable x, SDVariable cLast, SDVariable yLast, LSTMWeights weights, LSTMConfiguration configuration)`

Method Summary

All Methods Instance Methods Concrete Methods
Modifier and Type	Method and Description
`List<DataType>`	`calculateOutputDataTypes(List<DataType> inputDataTypes)` Calculate the data types for the output arrays.
`List<SDVariable>`	`doDiff(List<SDVariable> grads)` The actual implementation for automatic differentiation.
`void`	`initFromTensorFlow(NodeDef nodeDef, SameDiff initWith, Map<String,AttrValue> attributesForNode, GraphDef graph)` Initialize the function from the given `NodeDef`
`String`	`opName()` This method returns op opName as string
`Map<String,Object>`	`propertiesForFunction()` Returns the properties for a given function
`String`	`tensorflowName()` The opName of this function tensorflow

Methods inherited from class org.nd4j.autodiff.functions.DifferentialFunction
arg, arg, argNames, args, attributeAdaptersForFunction, configFieldName, diff, dup, equals, getNumOutputs, getValue, hashCode, isConfigProperties, larg, mappingsForFunction, onnxNames, outputs, outputVariable, outputVariablesNames, rarg, replaceArg, setInstanceId, setPropertiesForFunction, setValueFor, tensorflowNames

Methods inherited from class java.lang.Object
clone, finalize, getClass, notify, notifyAll, wait, wait, wait

Methods inherited from interface org.nd4j.linalg.api.ops.CustomOp
isInplaceCall

- Constructor Detail
  - LSTMBlock
```
public LSTMBlock()
```
  - LSTMBlock
```
public LSTMBlock(@NonNull
                 @NonNull SameDiff sameDiff,
                 SDVariable maxTSLength,
                 SDVariable x,
                 SDVariable cLast,
                 SDVariable yLast,
                 LSTMWeights weights,
                 LSTMConfiguration configuration)
```
  - LSTMBlock
```
public LSTMBlock(INDArray x,
                 INDArray cLast,
                 INDArray yLast,
                 INDArray maxTSLength,
                 LSTMWeights lstmWeights,
                 LSTMConfiguration lstmConfiguration)
```
- Method Detail
  - calculateOutputDataTypes
```
public List<DataType> calculateOutputDataTypes(List<DataType> inputDataTypes)
```
    Description copied from class: DifferentialFunction
    
    Calculate the data types for the output arrays. Though datatypes can also be inferred from DifferentialFunction.calculateOutputShape(), this method differs in that it does not require the input arrays to be populated. This is important as it allows us to do greedy datatype inference for the entire net - even if arrays are not available.
    
    Overrides:
    
    calculateOutputDataTypes in class DifferentialFunction
    
    Parameters:
    
    inputDataTypes - The data types of the inputs
    
    Returns:
    
    The data types of the outputs
  - doDiff
```
public List<SDVariable> doDiff(List<SDVariable> grads)
```
    Description copied from class: DifferentialFunction
    
    The actual implementation for automatic differentiation.
    
    Overrides:
    
    doDiff in class DynamicCustomOp
    
    Returns:
  - initFromTensorFlow
```
public void initFromTensorFlow(NodeDef nodeDef,
                               SameDiff initWith,
                               Map<String,AttrValue> attributesForNode,
                               GraphDef graph)
```
    Description copied from class: DifferentialFunction
    
    Initialize the function from the given NodeDef
    
    Overrides:
    
    initFromTensorFlow in class DynamicCustomOp
  - opName
```
public String opName()
```
    Description copied from class: DynamicCustomOp
    
    This method returns op opName as string
    
    Specified by:
    
    opName in interface CustomOp
    
    Overrides:
    
    opName in class DynamicCustomOp
    
    Returns:
  - propertiesForFunction
```
public Map<String,Object> propertiesForFunction()
```
    Description copied from class: DifferentialFunction
    
    Returns the properties for a given function
    
    Overrides:
    
    propertiesForFunction in class DifferentialFunction
    
    Returns:
  - tensorflowName
```
public String tensorflowName()
```
    Description copied from class: DifferentialFunction
    
    The opName of this function tensorflow
    
    Overrides:
    
    tensorflowName in class DynamicCustomOp
    
    Returns:

Class LSTMBlock

Nested Class Summary

Nested classes/interfaces inherited from class org.nd4j.linalg.api.ops.DynamicCustomOp

Field Summary

Fields inherited from class org.nd4j.linalg.api.ops.DynamicCustomOp

Fields inherited from class org.nd4j.autodiff.functions.DifferentialFunction

Constructor Summary

Method Summary

Methods inherited from class org.nd4j.linalg.api.ops.DynamicCustomOp

Methods inherited from class org.nd4j.autodiff.functions.DifferentialFunction

Methods inherited from class java.lang.Object

Methods inherited from interface org.nd4j.linalg.api.ops.CustomOp

Constructor Detail

LSTMBlock

LSTMBlock

LSTMBlock

Method Detail

calculateOutputDataTypes

doDiff

initFromTensorFlow

opName

propertiesForFunction

tensorflowName