BahdanauAttention

Instance Constructors

new BahdanauAttention(memory: Output, memoryWeights: Output, queryWeights: Output, scoreWeights: Output, memorySequenceLengths: Output = null, normalizationFactor: Output = null, normalizationBias: Output = null, probabilityFn: (Output) ⇒ Output = NN.softmax(_, name = "Probability"), scoreMaskValue: Output = Float.NegativeInfinity, name: String = "BahdanauAttention")

memory
Memory to query; usually the output of an RNN encoder. Each tensor in the memory should be shaped [batchSize, maxTime, ...].
memoryWeights
Weights tensor with which the memory is multiplied to produce the attention keys.
queryWeights
Weights tensor with which the query is multiplied to produce the attention query.
scoreWeights
Weights tensor with which the score components are multiplied before being summed.
memorySequenceLengths
Sequence lengths for the batch entries in the memory. If provided, the memory tensor rows are masked with zeros for values past the respective sequence lengths.
normalizationFactor
Scalar tensor used to normalize the alignment score energy term; usually a trainable variable initialized to sqrt((1 / numUnits)).
normalizationBias
Vector bias added to the alignment scores prior to applying the non-linearity; usually a variable initialized to zeros.
probabilityFn
Optional function that converts computed scores to probabilities. Defaults to the softmax function. A potentially useful alternative is the hardmax function.
scoreMaskValue
Mask value to use for the score before passing it to probabilityFn. Defaults to negative infinity. Note that this value is only used if memorySequenceLengths is not null.
name
Name prefix to use for all created ops.

Value Members

final def !=(arg0: Any): Boolean

Definition Classes
AnyRef → Any
final def ##(): Int

Definition Classes
AnyRef → Any
final def ==(arg0: Any): Boolean

Definition Classes
AnyRef → Any
def alignment(query: Output, previousState: Output): (Output, Output)

Computes an alignment tensor given the provided query and previous alignment tensor.
Computes an alignment tensor given the provided query and previous alignment tensor.
The previous alignment tensor is important for attention mechanisms that use the previous alignment to calculate the attention at the next time step, such as monotonic attention mechanisms.
TODO: Figure out how to generalize the "next state" functionality.
query
Query tensor.
previousState
Previous alignment tensor.
returns
Tuple containing the alignment tensor and the next attention state.

Definition Classes
SimpleAttention → Attention
lazy val alignmentSize: Output

Definition Classes
Attention
final def asInstanceOf[T0]: T0

Definition Classes
Any
lazy val batchSize: Output

Definition Classes
Attention
val checkInnerDimensionsDefined: Boolean

Definition Classes
SimpleAttention → Attention
def clone(): AnyRef

Attributes
protected[java.lang]
Definition Classes
AnyRef
Annotations
@throws( ... )
lazy val dataType: types.DataType

Definition Classes
Attention
final def eq(arg0: AnyRef): Boolean

Definition Classes
AnyRef
def equals(arg0: Any): Boolean

Definition Classes
AnyRef → Any
def finalize(): Unit

Attributes
protected[java.lang]
Definition Classes
AnyRef
Annotations
@throws( classOf[java.lang.Throwable] )
final def getClass(): Class[_]

Definition Classes
AnyRef → Any
def hashCode(): Int

Definition Classes
AnyRef → Any
lazy val initialAlignment: Output

Initial alignment value.
Initial alignment value.
This is important for attention mechanisms that use the previous alignment to calculate the alignment at the next time step (e.g., monotonic attention).
The default behavior is to return a tensor of all zeros.

Definition Classes
Attention
def initialState: Output

Initial state value.
Initial state value.
This is important for attention mechanisms that use the previous alignment to calculate the alignment at the next time step (e.g., monotonic attention).
The default behavior is to return the same output as initialAlignment.

Definition Classes
SimpleAttention → Attention
final def isInstanceOf[T0]: Boolean

Definition Classes
Any
lazy val keys: Output

Definition Classes
BahdanauAttention → Attention
val memory: Output

Memory to query; usually the output of an RNN encoder.
Memory to query; usually the output of an RNN encoder. Each tensor in the memory should be shaped [batchSize, maxTime, ...].

Attributes
protected
Definition Classes
BahdanauAttention → SimpleAttention → Attention
val memorySequenceLengths: Output

Sequence lengths for the batch entries in the memory.
Sequence lengths for the batch entries in the memory. If provided, the memory tensor rows are masked with zeros for values past the respective sequence lengths.

Attributes
protected
Definition Classes
BahdanauAttention → SimpleAttention → Attention
val memoryWeights: Output

Weights tensor with which the memory is multiplied to produce the attention keys.
Weights tensor with which the memory is multiplied to produce the attention keys.

Attributes
protected
val name: String

Name prefix to use for all created ops.
Name prefix to use for all created ops.

Definition Classes
BahdanauAttention → SimpleAttention → Attention
final def ne(arg0: AnyRef): Boolean

Definition Classes
AnyRef
val normalizationBias: Output

Vector bias added to the alignment scores prior to applying the non-linearity; usually a variable initialized to zeros.
Vector bias added to the alignment scores prior to applying the non-linearity; usually a variable initialized to zeros.

Attributes
protected
val normalizationFactor: Output

Scalar tensor used to normalize the alignment score energy term; usually a trainable variable initialized to sqrt((1 / numUnits)).
Scalar tensor used to normalize the alignment score energy term; usually a trainable variable initialized to sqrt((1 / numUnits)).

Attributes
protected
final def notify(): Unit

Definition Classes
AnyRef
final def notifyAll(): Unit

Definition Classes
AnyRef
def probability(score: Output, previousAlignment: Output): Output

Computes alignment probabilities for score.
Computes alignment probabilities for score.
score
Alignment score tensor.
returns
Alignment probabilities tensor.

Attributes
protected
Definition Classes
BahdanauAttention → Attention
val probabilityFn: (Output) ⇒ Output

Optional function that converts computed scores to probabilities.
Optional function that converts computed scores to probabilities. Defaults to the softmax function. A potentially useful alternative is the hardmax function.

Attributes
protected
val queryWeights: Output

Weights tensor with which the query is multiplied to produce the attention query.
Weights tensor with which the query is multiplied to produce the attention query.

Attributes
protected
def score(query: Output, previousAlignment: Output): Output

Computes an alignment score for query.
Computes an alignment score for query.
query
Query tensor.
returns
Score tensor.

Attributes
protected
Definition Classes
BahdanauAttention → Attention
Annotations
@throws( ... )
val scoreMaskValue: Output

Mask value to use for the score before passing it to probabilityFn.
Mask value to use for the score before passing it to probabilityFn. Defaults to negative infinity. Note that this value is only used if memorySequenceLengths is not null.

Definition Classes
BahdanauAttention → SimpleAttention → Attention
val scoreWeights: Output

Weights tensor with which the score components are multiplied before being summed.
Weights tensor with which the score components are multiplied before being summed.

Attributes
protected
def stateSize: core.Shape

Definition Classes
SimpleAttention → Attention
final def synchronized[T0](arg0: ⇒ T0): T0

Definition Classes
AnyRef
def toString(): String

Definition Classes
AnyRef → Any
lazy val values: Output

Definition Classes
Attention
final def wait(): Unit

Definition Classes
AnyRef
Annotations
@throws( ... )
final def wait(arg0: Long, arg1: Int): Unit

Definition Classes
AnyRef
Annotations
@throws( ... )
final def wait(arg0: Long): Unit

Definition Classes
AnyRef
Annotations
@throws( ... )

Related Docs: object BahdanauAttention | package attention

class BahdanauAttention extends SimpleAttention

Instance Constructors

Value Members

final def !=(arg0: Any): Boolean

final def ##(): Int

final def ==(arg0: Any): Boolean

def alignment(query: Output, previousState: Output): (Output, Output)

lazy val alignmentSize: Output

final def asInstanceOf[T0]: T0

lazy val batchSize: Output

val checkInnerDimensionsDefined: Boolean

def clone(): AnyRef

lazy val dataType: types.DataType

final def eq(arg0: AnyRef): Boolean

def equals(arg0: Any): Boolean

def finalize(): Unit

final def getClass(): Class[_]

def hashCode(): Int

lazy val initialAlignment: Output

def initialState: Output

final def isInstanceOf[T0]: Boolean

lazy val keys: Output

val memory: Output

val memorySequenceLengths: Output

val memoryWeights: Output

val name: String

final def ne(arg0: AnyRef): Boolean

val normalizationBias: Output

val normalizationFactor: Output

final def notify(): Unit

final def notifyAll(): Unit

def probability(score: Output, previousAlignment: Output): Output

val probabilityFn: (Output) ⇒ Output

val queryWeights: Output

def score(query: Output, previousAlignment: Output): Output

val scoreMaskValue: Output

val scoreWeights: Output

def stateSize: core.Shape

final def synchronized[T0](arg0: ⇒ T0): T0

def toString(): String

lazy val values: Output

final def wait(): Unit

final def wait(arg0: Long, arg1: Int): Unit

final def wait(arg0: Long): Unit

Inherited from SimpleAttention

Inherited from Attention[Output, core.Shape]

Inherited from AnyRef

Inherited from Any

Ungrouped