case class AttentionDecoder[T, M <: StatefulModule[Variable, Variable, T], M0 <: Module](decoder: M with StatefulModule[Variable, Variable, T], embedding: M0 with Module, stateToKey: (T) => Variable, keyValue: Variable, tokens: Variable, padToken: Long) extends StatefulModule[Variable, Variable, T] with Product with Serializable
- Alphabetic
- By Inheritance
- AttentionDecoder
- Serializable
- Product
- Equals
- GenericModule
- AnyRef
- Any
- Hide All
- Show All
- Public
- Protected
Instance Constructors
Value Members
- final def !=(arg0: Any): Boolean
- Definition Classes
- AnyRef → Any
- final def ##: Int
- Definition Classes
- AnyRef → Any
- final def ==(arg0: Any): Boolean
- Definition Classes
- AnyRef → Any
- def apply[S](a: (Variable, T))(implicit arg0: Sc[S]): (Variable, T)
Alias of forward
Alias of forward
- Definition Classes
- GenericModule
- final def asInstanceOf[T0]: T0
- Definition Classes
- Any
- def clone(): AnyRef
- Attributes
- protected[lang]
- Definition Classes
- AnyRef
- Annotations
- @throws(classOf[java.lang.CloneNotSupportedException]) @native()
- val decoder: M with StatefulModule[Variable, Variable, T]
- val embedding: M0 with Module
- final def eq(arg0: AnyRef): Boolean
- Definition Classes
- AnyRef
- def finalize(): Unit
- Attributes
- protected[lang]
- Definition Classes
- AnyRef
- Annotations
- @throws(classOf[java.lang.Throwable])
- def forward[S](x: (Variable, T))(implicit arg0: Sc[S]): (Variable, T)
The implementation of the function.
The implementation of the function.
In addition of
x
it can also use all thestate to compute its value.
- Definition Classes
- AttentionDecoder → GenericModule
- def forward1[S](x: Variable, state: T)(implicit arg0: Sc[S]): (Variable, T)
- final def getClass(): Class[_ <: AnyRef]
- Definition Classes
- AnyRef → Any
- Annotations
- @native()
- final def gradients(loss: Variable, zeroGrad: Boolean = true): Seq[Option[STen]]
Computes the gradient of loss with respect to the parameters.
Computes the gradient of loss with respect to the parameters.
- Definition Classes
- GenericModule
- final def isInstanceOf[T0]: Boolean
- Definition Classes
- Any
- val keyValue: Variable
- final def learnableParameters: Long
Returns the total number of optimizable parameters.
Returns the total number of optimizable parameters.
- Definition Classes
- GenericModule
- final def ne(arg0: AnyRef): Boolean
- Definition Classes
- AnyRef
- final def notify(): Unit
- Definition Classes
- AnyRef
- Annotations
- @native()
- final def notifyAll(): Unit
- Definition Classes
- AnyRef
- Annotations
- @native()
- val padToken: Long
- final def parameters: Seq[(Constant, PTag)]
Returns the state variables which need gradient computation.
Returns the state variables which need gradient computation.
- Definition Classes
- GenericModule
- def productElementNames: Iterator[String]
- Definition Classes
- Product
- def state: Seq[(Constant, PTag)]
List of optimizable, or non-optimizable, but stateful parameters
List of optimizable, or non-optimizable, but stateful parameters
Stateful means that the state is carried over the repeated forward calls.
- Definition Classes
- AttentionDecoder → GenericModule
- val stateToKey: (T) => Variable
- final def synchronized[T0](arg0: => T0): T0
- Definition Classes
- AnyRef
- val tokens: Variable
- final def wait(): Unit
- Definition Classes
- AnyRef
- Annotations
- @throws(classOf[java.lang.InterruptedException])
- final def wait(arg0: Long, arg1: Int): Unit
- Definition Classes
- AnyRef
- Annotations
- @throws(classOf[java.lang.InterruptedException])
- final def wait(arg0: Long): Unit
- Definition Classes
- AnyRef
- Annotations
- @throws(classOf[java.lang.InterruptedException]) @native()
- final def zeroGrad(): Unit
- Definition Classes
- GenericModule