public class TransformerEncoderBlock extends AbstractBlock
children, inputNames, inputShapes, parameters, parameterShapeCallbacks, version| Constructor and Description |
|---|
TransformerEncoderBlock(int embeddingSize,
int headCount,
int hiddenSize,
float dropoutProbability,
java.util.function.Function<NDList,NDList> activationFunction)
Creates a transformer encoder block.
|
| Modifier and Type | Method and Description |
|---|---|
NDList |
forward(ParameterStore ps,
NDList inputs,
boolean training)
Applies the operating function of the block once.
|
protected NDList |
forwardInternal(ParameterStore ps,
NDList inputs,
boolean training,
ai.djl.util.PairList<java.lang.String,java.lang.Object> params) |
Shape[] |
getOutputShapes(NDManager manager,
Shape[] inputShapes)
Returns the expected output shapes of the block for the specified input shapes.
|
void |
initializeChildBlocks(NDManager manager,
DataType dataType,
Shape... inputShapes)
Initializes the Child blocks of this block.
|
addChildBlock, addParameter, addParameter, addParameter, beforeInitialize, cast, clear, describeInput, forward, getChildren, getDirectParameters, getParameters, getParameterShape, initialize, isInitialized, loadMetadata, loadParameters, readInputShapes, saveInputShapes, saveMetadata, saveParameters, setInitializer, setInitializer, toStringclone, equals, finalize, getClass, hashCode, notify, notifyAll, wait, wait, waitforward, validateLayoutpublic TransformerEncoderBlock(int embeddingSize,
int headCount,
int hiddenSize,
float dropoutProbability,
java.util.function.Function<NDList,NDList> activationFunction)
embeddingSize - the embedding size for tokensheadCount - number of attention blockshiddenSize - the hidden size for fully connected networksdropoutProbability - dropout probabilityactivationFunction - activation functionpublic Shape[] getOutputShapes(NDManager manager, Shape[] inputShapes)
manager - an NDManagerinputShapes - the shapes of the inputspublic void initializeChildBlocks(NDManager manager, DataType dataType, Shape... inputShapes)
initializeChildBlocks in class AbstractBlockmanager - the manager to use for initializationdataType - the requested data typeinputShapes - the expected input shapes for this blockprotected NDList forwardInternal(ParameterStore ps, NDList inputs, boolean training, ai.djl.util.PairList<java.lang.String,java.lang.Object> params)
forwardInternal in class AbstractBlockpublic NDList forward(ParameterStore ps, NDList inputs, boolean training)
ps - the parameter storeinputs - the input NDListtraining - true for a training forward pass