Packages

case class TransformerEncoderBlock(attention: MultiheadAttention, layerNorm1: LayerNorm, layerNorm2: LayerNorm, w1: Constant, b1: Constant, w2: Constant, b2: Constant, dropout: Double, train: Boolean) extends GenericModule[(Variable, STen), Variable] with Product with Serializable

A single block of the transformer encoder as defined in Fig 10.7.1 in d2l v0.16

Linear Supertypes
Serializable, Product, Equals, GenericModule[(Variable, STen), Variable], AnyRef, Any
Ordering
  1. Alphabetic
  2. By Inheritance
Inherited
  1. TransformerEncoderBlock
  2. Serializable
  3. Product
  4. Equals
  5. GenericModule
  6. AnyRef
  7. Any
  1. Hide All
  2. Show All
Visibility
  1. Public
  2. Protected

Instance Constructors

  1. new TransformerEncoderBlock(attention: MultiheadAttention, layerNorm1: LayerNorm, layerNorm2: LayerNorm, w1: Constant, b1: Constant, w2: Constant, b2: Constant, dropout: Double, train: Boolean)

Value Members

  1. final def !=(arg0: Any): Boolean
    Definition Classes
    AnyRef → Any
  2. final def ##: Int
    Definition Classes
    AnyRef → Any
  3. final def ==(arg0: Any): Boolean
    Definition Classes
    AnyRef → Any
  4. def apply[S](a: (Variable, STen))(implicit arg0: Sc[S]): Variable

    Alias of forward

    Alias of forward

    Definition Classes
    GenericModule
  5. final def asInstanceOf[T0]: T0
    Definition Classes
    Any
  6. val attention: MultiheadAttention
  7. val b1: Constant
  8. val b2: Constant
  9. def clone(): AnyRef
    Attributes
    protected[lang]
    Definition Classes
    AnyRef
    Annotations
    @throws(classOf[java.lang.CloneNotSupportedException]) @native()
  10. val dropout: Double
  11. final def eq(arg0: AnyRef): Boolean
    Definition Classes
    AnyRef
  12. def finalize(): Unit
    Attributes
    protected[lang]
    Definition Classes
    AnyRef
    Annotations
    @throws(classOf[java.lang.Throwable])
  13. def forward[S](x: (Variable, STen))(implicit arg0: Sc[S]): Variable

    The implementation of the function.

    The implementation of the function.

    In addition of x it can also use all the state to compute its value.

    Definition Classes
    TransformerEncoderBlockGenericModule
  14. final def getClass(): Class[_ <: AnyRef]
    Definition Classes
    AnyRef → Any
    Annotations
    @native()
  15. final def gradients(loss: Variable, zeroGrad: Boolean = true): Seq[Option[STen]]

    Computes the gradient of loss with respect to the parameters.

    Computes the gradient of loss with respect to the parameters.

    Definition Classes
    GenericModule
  16. final def isInstanceOf[T0]: Boolean
    Definition Classes
    Any
  17. val layerNorm1: LayerNorm
  18. val layerNorm2: LayerNorm
  19. final def learnableParameters: Long

    Returns the total number of optimizable parameters.

    Returns the total number of optimizable parameters.

    Definition Classes
    GenericModule
  20. final def ne(arg0: AnyRef): Boolean
    Definition Classes
    AnyRef
  21. final def notify(): Unit
    Definition Classes
    AnyRef
    Annotations
    @native()
  22. final def notifyAll(): Unit
    Definition Classes
    AnyRef
    Annotations
    @native()
  23. final def parameters: Seq[(Constant, PTag)]

    Returns the state variables which need gradient computation.

    Returns the state variables which need gradient computation.

    Definition Classes
    GenericModule
  24. def productElementNames: Iterator[String]
    Definition Classes
    Product
  25. def state: List[(Constant, LeafTag)]

    List of optimizable, or non-optimizable, but stateful parameters

    List of optimizable, or non-optimizable, but stateful parameters

    Stateful means that the state is carried over the repeated forward calls.

    Definition Classes
    TransformerEncoderBlockGenericModule
  26. final def synchronized[T0](arg0: => T0): T0
    Definition Classes
    AnyRef
  27. val train: Boolean
  28. val w1: Constant
  29. val w2: Constant
  30. final def wait(): Unit
    Definition Classes
    AnyRef
    Annotations
    @throws(classOf[java.lang.InterruptedException])
  31. final def wait(arg0: Long, arg1: Int): Unit
    Definition Classes
    AnyRef
    Annotations
    @throws(classOf[java.lang.InterruptedException])
  32. final def wait(arg0: Long): Unit
    Definition Classes
    AnyRef
    Annotations
    @throws(classOf[java.lang.InterruptedException]) @native()
  33. final def zeroGrad(): Unit
    Definition Classes
    GenericModule

Inherited from Serializable

Inherited from Product

Inherited from Equals

Inherited from GenericModule[(Variable, STen), Variable]

Inherited from AnyRef

Inherited from Any

Ungrouped