ScaledDotProductAttentionBlock.Builder (Deep Java Library 0.33.0 API specification)

java.lang.Object

ai.djl.nn.transformer.ScaledDotProductAttentionBlock.Builder

Enclosing class:: ScaledDotProductAttentionBlock

public static final class ScaledDotProductAttentionBlock.Builder extends Object

A builder for ScaledDotProductAttentionBlocks.

Method Summary

Modifier and Type

Method

Description

ScaledDotProductAttentionBlock

build()

Creates a new ScaledDotProductAttentionBlock with the current configuration.

ScaledDotProductAttentionBlock.Builder

optAttentionProbsDropoutProb(float attentionProbsDropoutProb)

Sets the probability of applying dropout to the attention probability distribution.

ScaledDotProductAttentionBlock.Builder

setEmbeddingSize(int embeddingSize)

Sets the embedding Size to be used for the internal token representation.

ScaledDotProductAttentionBlock.Builder

setHeadCount(int headCount)

Sets the number of attention Heads, must divide the embedding size without rest.

Methods inherited from class java.lang.Object
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait

Method Details
- setEmbeddingSize
  
  public ScaledDotProductAttentionBlock.Builder setEmbeddingSize(int embeddingSize)
  
  Sets the embedding Size to be used for the internal token representation.
  
  Parameters:
  
  embeddingSize - the embedding Size to be used for the internal token representation.
  
  Returns:
  
  this builder
- setHeadCount
  
  public ScaledDotProductAttentionBlock.Builder setHeadCount(int headCount)
  
  Sets the number of attention Heads, must divide the embedding size without rest. I.e. if embeddingSize = 10, a headCount of 3 would not be valid, a headCount of 1, 2 or 5 would be.
  
  Parameters:
  
  headCount - the number of attention Heads
  
  Returns:
  
  this builder
- optAttentionProbsDropoutProb
  
  public ScaledDotProductAttentionBlock.Builder optAttentionProbsDropoutProb(float attentionProbsDropoutProb)
  
  Sets the probability of applying dropout to the attention probability distribution. This dropout can randomly remove a complete token from the result at a position.
  
  Parameters:
  
  attentionProbsDropoutProb - the probability of applying dropout to the attention probability distribution
  
  Returns:
  
  this builder
- build
  
  public ScaledDotProductAttentionBlock build()
  
  Creates a new ScaledDotProductAttentionBlock with the current configuration.
  
  Returns:
  
  a new ScaledDotProductAttentionBlock with the current configuration.