case class TransformerEncoder(blocks: Seq[TransformerEncoderBlock]) extends GenericModule[(Variable, STen), Variable]
TransformerEncoder module
Input is (data, tokens)
where data
is (batch, num tokens, in dimension),
double tensor tokens
is (batch,num tokens) long tensor.
Output is (bach, num tokens, out dimension)
The sole purpose of tokens
is to carry over the padding
- Companion:
- object
trait Serializable
trait Product
trait Equals
class Object
trait Matchable
class Any
Value members
Inherited methods
Computes the gradient of loss with respect to the parameters.
Computes the gradient of loss with respect to the parameters.
- Inherited from:
- GenericModule
Returns the total number of optimizable parameters.
Returns the total number of optimizable parameters.
- Inherited from:
- GenericModule
Returns the state variables which need gradient computation.
Returns the state variables which need gradient computation.
- Inherited from:
- GenericModule