LSTM

Instance Constructors

new LSTM(inputSize: Int, hiddenSize: Int, p: Double = 0, wRegularizer: Regularizer[T] = null, uRegularizer: Regularizer[T] = null, bRegularizer: Regularizer[T] = null)(implicit arg0: ClassTag[T], ev: TensorNumeric[T])

inputSize
the size of each input vector
hiddenSize
Hidden unit size in the LSTM
p
is used for Dropout probability. For more details about RNN dropouts, please refer to [RnnDrop: A Novel Dropout for RNNs in ASR] (http://www.stat.berkeley.edu/~tsmoon/files/Conference/asru2015.pdf) [A Theoretically Grounded Application of Dropout in Recurrent Neural Networks] (https://arxiv.org/pdf/1512.05287.pdf)

Value Members

final def !=(arg0: Any): Boolean

Definition Classes
AnyRef → Any
final def ##(): Int

Definition Classes
AnyRef → Any
final def ==(arg0: Any): Boolean

Definition Classes
AnyRef → Any
def accGradParameters(input: Table, gradOutput: Table): Unit

Computing the gradient of the module with respect to its own parameters.
Computing the gradient of the module with respect to its own parameters. Many modules do not perform this step as they do not have any parameters. The state variable name for the parameters is module dependent. The module is expected to accumulate the gradients with respect to the parameters in some variable.

Definition Classes
Cell → AbstractModule
def addTimes(other: Cell[T]): Unit

Definition Classes
Cell
def apply(name: String): Option[AbstractModule[Activity, Activity, T]]

Find a module with given name.
Find a module with given name. If there is no module with given name, it will return None. If there are multiple modules with the given name, an exception will be thrown.

Definition Classes
AbstractModule
final def asInstanceOf[T0]: T0

Definition Classes
Any
var bRegularizer: Regularizer[T]
def backward(input: Table, gradOutput: Table): Table

Performs a back-propagation step through the module, with respect to the given input.
Performs a back-propagation step through the module, with respect to the given input. In general this method makes the assumption forward(input) has been called before, with the same input. This is necessary for optimization reasons. If you do not respect this rule, backward() will compute incorrect gradients.
input
input data
gradOutput
gradient of next layer
returns
gradient corresponding to input data

Definition Classes
Cell → AbstractModule
var backwardTime: Long

Attributes
protected
Definition Classes
AbstractModule
var backwardTimes: Array[Long]

Definition Classes
Cell
def buildGates()(input1: ModuleNode[T], input2: ModuleNode[T]): (ModuleNode[T], ModuleNode[T], ModuleNode[T], ModuleNode[T])
def buildLSTM(): Graph[T]
def buildModel(): Sequential[T]
def canEqual(other: Any): Boolean

Definition Classes
LSTM → AbstractModule
var cell: AbstractModule[Activity, Activity, T]

Any recurrent kernels should have a cell member variable which represents the module in the kernel.
Any recurrent kernels should have a cell member variable which represents the module in the kernel.
The cell receive an input with a format of T(input, preHiddens), and the output should be a format of T(output, hiddens). The hiddens represents the kernel's output hiddens at the current time step, which will be transferred to next time step. For instance, a simple RnnCell, hiddens is h, for LSTM, hiddens is T(h, c), and for both of them, the output variable represents h. Similarly the preHiddens is the kernel's output hiddens at the previous time step.

Definition Classes
LSTM → Cell
var cellLayer: Sequential[T]
def checkEngineType(): LSTM.this.type

get execution engine type
get execution engine type

Definition Classes
AbstractModule
def clearState(): LSTM.this.type

Clear cached activities to save storage space or network bandwidth.
Clear cached activities to save storage space or network bandwidth. Note that we use Tensor.set to keep some information like tensor share
The subclass should override this method if it allocate some extra resource, and call the super.clearState in the override method

Definition Classes
AbstractModule
def clone(): AnyRef

Attributes
protected[java.lang]
Definition Classes
AnyRef
Annotations
@throws( ... )
def cloneModule(): AbstractModule[Table, Table, T]

Definition Classes
AbstractModule
def copyStatus(src: Module[T]): LSTM.this.type

Copy the useful running status from src to this.
Copy the useful running status from src to this.
The subclass should override this method if it has some parameters besides weight and bias. Such as runningMean and runningVar of BatchNormalization.
src
source Module
returns
this

Definition Classes
AbstractModule
final def eq(arg0: AnyRef): Boolean

Definition Classes
AnyRef
def equals(other: Any): Boolean

Definition Classes
LSTM → AbstractModule → AnyRef → Any
def evaluate(dataSet: LocalDataSet[MiniBatch[T]], vMethods: Array[ValidationMethod[T]]): Array[(ValidationResult, ValidationMethod[T])]

Definition Classes
AbstractModule
def evaluate(dataset: RDD[Sample[T]], vMethods: Array[ValidationMethod[T]], batchSize: Option[Int] = None): Array[(ValidationResult, ValidationMethod[T])]

use ValidationMethod to evaluate module
use ValidationMethod to evaluate module
dataset
dataset for test
vMethods
validation methods
batchSize
total batchsize of all partitions, optional param and default 4 * partitionNum of dataset

Definition Classes
AbstractModule
def evaluate(): LSTM.this.type

Definition Classes
AbstractModule
def finalize(): Unit

Attributes
protected[java.lang]
Definition Classes
AnyRef
Annotations
@throws( classOf[java.lang.Throwable] )
final def forward(input: Table): Table

Takes an input object, and computes the corresponding output of the module.
Takes an input object, and computes the corresponding output of the module. After a forward, the output state variable should have been updated to the new value.
input
input data
returns
output data

Definition Classes
AbstractModule
var forwardTime: Long

Attributes
protected
Definition Classes
AbstractModule
var forwardTimes: Array[Long]

Definition Classes
Cell
def freeze(names: String*): LSTM.this.type

freeze the module, i.e.
freeze the module, i.e. their parameters(weight/bias, if exists) are not changed in training process if names is not empty, set an array of layers that match the given names to be "freezed",
names
an array of layer names
returns
current graph model

Definition Classes
AbstractModule
var gates: Sequential[T]
final def getClass(): Class[_]

Definition Classes
AnyRef → Any
def getName(): String

Get the module name, default name is className@namePostfix
Get the module name, default name is className@namePostfix

Definition Classes
AbstractModule
def getNamePostfix: String

Definition Classes
AbstractModule
def getNumericType(): TensorDataType

returns
Float or Double

Definition Classes
AbstractModule
def getParameters(): (Tensor[T], Tensor[T])

This method compact all parameters and gradients of the model into two tensors.
This method compact all parameters and gradients of the model into two tensors. So it's easier to use optim method

Definition Classes
AbstractModule
def getParametersTable(): Table

This function returns a table contains ModuleName, the parameter names and parameter value in this module.
This function returns a table contains ModuleName, the parameter names and parameter value in this module. The result table is a structure of Table(ModuleName -> Table(ParameterName -> ParameterValue)), and the type is Table[String, Table[String, Tensor[T]]].
For example, get the weight of a module named conv1: table[Table]("conv1")[Tensor[T]]("weight").
Custom modules should override this function if they have parameters.
returns
Table

Definition Classes
Cell → AbstractModule
def getPrintName(): String

Attributes
protected
Definition Classes
AbstractModule
def getScaleB(): Double

Get the scale of gradientBias
Get the scale of gradientBias

Definition Classes
AbstractModule
def getScaleW(): Double

Get the scale of gradientWeight
Get the scale of gradientWeight

Definition Classes
AbstractModule
def getTimes(): Array[(AbstractModule[_ <: Activity, _ <: Activity, T], Long, Long)]

Definition Classes
Cell → AbstractModule
def getWeightsBias(): Array[Tensor[T]]

Get weight and bias for the module
Get weight and bias for the module
returns
array of weights and bias

Definition Classes
AbstractModule
var gradInput: Table

The cached gradient of activities.
The cached gradient of activities. So we don't compute it again when need it

Definition Classes
AbstractModule
def hasName: Boolean

Definition Classes
AbstractModule
def hashCode(): Int

Definition Classes
LSTM → AbstractModule → AnyRef → Any
def hidResize(hidden: Activity, batchSize: Int, stepShape: Array[Int]): Activity

resize the hidden parameters wrt the batch size, hiddens shapes.
resize the hidden parameters wrt the batch size, hiddens shapes.
e.g. RnnCell contains 1 hidden parameter (H), thus it will return Tensor(size) LSTM contains 2 hidden parameters (C and H) and will return T(Tensor(), Tensor())\ and recursively intialize all the tensors in the Table.
batchSize
batchSize
stepShape
For rnn/lstm/gru, it's embedding size. For convlstm/ convlstm3D, it's a list of outputPlane, length, width, height

Definition Classes
Cell
val hiddenSize: Int

Hidden unit size in the LSTM
def hiddenSizeOfPreTopo: Int

Definition Classes
LSTM → Cell
val hiddensShape: Array[Int]

represents the shape of hiddens which would be transferred to the next recurrent time step.
represents the shape of hiddens which would be transferred to the next recurrent time step. E.g. For RnnCell, it should be Array(hiddenSize) For LSTM, it should be Array(hiddenSize, hiddenSize) (because each time step a LSTM return two hiddens h and c in order, which have the same size.)

Definition Classes
Cell
val inputSize: Int

the size of each input vector
def inputs(first: (ModuleNode[T], Int), nodesWithIndex: (ModuleNode[T], Int)*): ModuleNode[T]

Build graph: some other modules point to current module
Build graph: some other modules point to current module
first
distinguish from another inputs when input parameter list is empty
nodesWithIndex
upstream module nodes and the output tensor index. The start index is 1.
returns
node containing current module

Definition Classes
AbstractModule
def inputs(nodes: ModuleNode[T]*): ModuleNode[T]

Build graph: some other modules point to current module
Build graph: some other modules point to current module
nodes
upstream module nodes
returns
node containing current module

Definition Classes
AbstractModule
final def isInstanceOf[T0]: Boolean

Definition Classes
Any
final def isTraining(): Boolean

Definition Classes
AbstractModule
var line: String

Attributes
protected
Definition Classes
AbstractModule
def loadModelWeights(srcModel: Module[Float], matchAll: Boolean = true): LSTM.this.type

copy weights from another model, mapping by layer name
copy weights from another model, mapping by layer name
srcModel
model to copy from
matchAll
whether to match all layers' weights and bias,
returns
current module

Definition Classes
AbstractModule
def loadWeights(weightPath: String, matchAll: Boolean = true): LSTM.this.type

load pretrained weights and bias to current module
load pretrained weights and bias to current module
weightPath
file to store weights and bias
matchAll
whether to match all layers' weights and bias, if not, only load existing pretrained weights and bias
returns
current module

Definition Classes
AbstractModule
final def ne(arg0: AnyRef): Boolean

Definition Classes
AnyRef
final def notify(): Unit

Definition Classes
AnyRef
final def notifyAll(): Unit

Definition Classes
AnyRef
var output: Table

The cached output.
The cached output. So we don't compute it again when need it

Definition Classes
AbstractModule
val p: Double

is used for Dropout probability.
is used for Dropout probability. For more details about RNN dropouts, please refer to [RnnDrop: A Novel Dropout for RNNs in ASR] (http://www.stat.berkeley.edu/~tsmoon/files/Conference/asru2015.pdf) [A Theoretically Grounded Application of Dropout in Recurrent Neural Networks] (https://arxiv.org/pdf/1512.05287.pdf)
def parameters(): (Array[Tensor[T]], Array[Tensor[T]])

This function returns two arrays.
This function returns two arrays. One for the weights and the other the gradients Custom modules should override this function if they have parameters
returns
(Array of weights, Array of grad)

Definition Classes
Cell → AbstractModule
var preTopology: TensorModule[T]

The preTopology defines operations to pre-process the input when it is not dependent on the time dimension.
The preTopology defines operations to pre-process the input when it is not dependent on the time dimension. For example, the i2h in SimpleRNN Cell can be calculated before the recurrence since all the input slices are independent.
This is particular useful to boost the performance of the recurrent layer.
Please define your own preTopology according to your Cell structure. Please refer to SimpleRNN or LSTM for reference.

Definition Classes
LSTM → Cell
def predict(dataset: RDD[Sample[T]], batchSize: Int = 1, shareBuffer: Boolean = false): RDD[Activity]

module predict, return the probability distribution
module predict, return the probability distribution
dataset
dataset for prediction
batchSize
total batchSize for all partitions. if -1, default is 4 * partitionNumber of datatset
shareBuffer
whether to share same memory for each batch predict results

Definition Classes
AbstractModule
def predictClass(dataset: RDD[Sample[T]], batchSize: Int = 1): RDD[Int]

module predict, return the predict label
module predict, return the predict label
dataset
dataset for prediction
batchSize
total batchSize for all partitions. if -1, default is 4 * partitionNumber of dataset

Definition Classes
AbstractModule
def quantize(): Module[T]

Definition Classes
AbstractModule
def regluarized(isRegularized: Boolean): Unit

Use this method to set the whether the recurrent cell is regularized
Use this method to set the whether the recurrent cell is regularized
isRegularized
whether to be regularized or not

Definition Classes
Cell
var regularizers: Array[Regularizer[T]]

If the subclass has regularizers, it need to put the regularizers into an array and pass the array into the Cell constructor as an argument.
If the subclass has regularizers, it need to put the regularizers into an array and pass the array into the Cell constructor as an argument. See LSTM as a concrete example.

Definition Classes
Cell
def reset(): Unit

Definition Classes
LSTM → Cell → AbstractModule
def resetTimes(): Unit

Definition Classes
Cell → AbstractModule
def saveCaffe(prototxtPath: String, modelPath: String, useV2: Boolean = true, overwrite: Boolean = false): LSTM.this.type

Definition Classes
AbstractModule
def saveDefinition(path: String, overWrite: Boolean = false): LSTM.this.type

Save this module definition to path.
Save this module definition to path.
path
path to save module, local file system, HDFS and Amazon S3 is supported. HDFS path should be like "hdfs://[host]:[port]/xxx" Amazon S3 path should be like "s3a://bucket/xxx"
overWrite
if overwrite
returns
self

Definition Classes
AbstractModule
def saveModule(path: String, overWrite: Boolean = false): LSTM.this.type

Save this module to path with protobuf format
Save this module to path with protobuf format
path
path to save module, local file system, HDFS and Amazon S3 is supported. HDFS path should be like "hdfs://[host]:[port]/xxx" Amazon S3 path should be like "s3a://bucket/xxx"
overWrite
if overwrite
returns
self

Definition Classes
AbstractModule
def saveTF(inputs: Seq[(String, Seq[Int])], path: String, byteOrder: ByteOrder = ByteOrder.LITTLE_ENDIAN, dataFormat: TensorflowDataFormat = TensorflowDataFormat.NHWC): LSTM.this.type

Definition Classes
AbstractModule
def saveTorch(path: String, overWrite: Boolean = false): LSTM.this.type

Definition Classes
AbstractModule
def saveWeights(path: String, overWrite: Boolean): Unit

save weights and bias to file
save weights and bias to file
path
file to save
overWrite
whether to overwrite or not

Definition Classes
AbstractModule
var scaleB: Double

Attributes
protected
Definition Classes
AbstractModule
var scaleW: Double

The scale of gradient weight and gradient bias before gradParameters being accumulated.
The scale of gradient weight and gradient bias before gradParameters being accumulated.

Attributes
protected
Definition Classes
AbstractModule
def setLine(line: String): LSTM.this.type

Definition Classes
AbstractModule
def setName(name: String): LSTM.this.type

Set the module name
Set the module name

Definition Classes
AbstractModule
def setNamePostfix(namePostfix: String): Unit

Definition Classes
AbstractModule
def setScaleB(b: Double): LSTM.this.type

Set the scale of gradientBias
Set the scale of gradientBias
b
the value of the scale of gradientBias
returns
this

Definition Classes
AbstractModule
def setScaleW(w: Double): LSTM.this.type

Set the scale of gradientWeight
Set the scale of gradientWeight
w
the value of the scale of gradientWeight
returns
this

Definition Classes
AbstractModule
def setWeightsBias(newWeights: Array[Tensor[T]]): LSTM.this.type

Set weight and bias for the module
Set weight and bias for the module
newWeights
array of weights and bias

Definition Classes
AbstractModule
var subModules: Array[AbstractModule[_ <: Activity, _ <: Activity, T]]

Definition Classes
Cell
final def synchronized[T0](arg0: ⇒ T0): T0

Definition Classes
AnyRef
var times: Array[(AbstractModule[_ <: Activity, _ <: Activity, T], Long, Long)]

Definition Classes
Cell
def toGraph(startNodes: ModuleNode[T]*): Graph[T]

Generate graph module with start nodes
Generate graph module with start nodes

Definition Classes
AbstractModule
def toString(): String

Definition Classes
LSTM → AbstractModule → AnyRef → Any
var train: Boolean

Module status.
Module status. It is useful for modules like dropout/batch normalization

Attributes
protected
Definition Classes
AbstractModule
def training(): LSTM.this.type

Definition Classes
AbstractModule
var uRegularizer: Regularizer[T]
def unFreeze(names: String*): LSTM.this.type

"unfreeze" module, i.e.
"unfreeze" module, i.e. make the module parameters(weight/bias, if exists) to be trained(updated) in training process if names is not empty, unfreeze layers that match given names
names
array of module names to unFreeze

Definition Classes
AbstractModule
def updateGradInput(input: Table, gradOutput: Table): Table

Computing the gradient of the module with respect to its own input.
Computing the gradient of the module with respect to its own input. This is returned in gradInput. Also, the gradInput state variable is updated accordingly.

Definition Classes
Cell → AbstractModule
def updateOutput(input: Table): Table

Computes the output using the current parameter set of the class and input.
Computes the output using the current parameter set of the class and input. This function returns the result which is stored in the output field.

Definition Classes
Cell → AbstractModule
def updateParameters(learningRate: T): Unit

Definition Classes
Cell → AbstractModule
var wRegularizer: Regularizer[T]
final def wait(): Unit

Definition Classes
AnyRef
Annotations
@throws( ... )
final def wait(arg0: Long, arg1: Int): Unit

Definition Classes
AnyRef
Annotations
@throws( ... )
final def wait(arg0: Long): Unit

Definition Classes
AnyRef
Annotations
@throws( ... )
def zeroGradParameters(): Unit

If the module has parameters, this will zero the accumulation of the gradients with respect to these parameters.
If the module has parameters, this will zero the accumulation of the gradients with respect to these parameters. Otherwise, it does nothing.

Definition Classes
Cell → AbstractModule

Deprecated Value Members

def save(path: String, overWrite: Boolean = false): LSTM.this.type

Save this module to path.
Save this module to path.
path
path to save module, local file system, HDFS and Amazon S3 is supported. HDFS path should be like "hdfs://[host]:[port]/xxx" Amazon S3 path should be like "s3a://bucket/xxx"
overWrite
if overwrite
returns
self

Definition Classes
AbstractModule
Annotations
@deprecated
Deprecated
please use recommended saveModule(path, overWrite)

Related Docs: object LSTM | package nn

class LSTM[T] extends Cell[T]

Instance Constructors

new LSTM(inputSize: Int, hiddenSize: Int, p: Double = 0, wRegularizer: Regularizer[T] = null, uRegularizer: Regularizer[T] = null, bRegularizer: Regularizer[T] = null)(implicit arg0: ClassTag[T], ev: TensorNumeric[T])

Value Members

final def !=(arg0: Any): Boolean

final def ##(): Int

final def ==(arg0: Any): Boolean

def accGradParameters(input: Table, gradOutput: Table): Unit

def addTimes(other: Cell[T]): Unit

def apply(name: String): Option[AbstractModule[Activity, Activity, T]]

final def asInstanceOf[T0]: T0

var bRegularizer: Regularizer[T]

def backward(input: Table, gradOutput: Table): Table

var backwardTime: Long

var backwardTimes: Array[Long]

def buildGates()(input1: ModuleNode[T], input2: ModuleNode[T]): (ModuleNode[T], ModuleNode[T], ModuleNode[T], ModuleNode[T])

def buildLSTM(): Graph[T]

def buildModel(): Sequential[T]

def canEqual(other: Any): Boolean

var cell: AbstractModule[Activity, Activity, T]

var cellLayer: Sequential[T]

def checkEngineType(): LSTM.this.type

def clearState(): LSTM.this.type

def clone(): AnyRef

def cloneModule(): AbstractModule[Table, Table, T]

def copyStatus(src: Module[T]): LSTM.this.type

final def eq(arg0: AnyRef): Boolean

def equals(other: Any): Boolean

def evaluate(dataSet: LocalDataSet[MiniBatch[T]], vMethods: Array[ValidationMethod[T]]): Array[(ValidationResult, ValidationMethod[T])]

def evaluate(dataset: RDD[Sample[T]], vMethods: Array[ValidationMethod[T]], batchSize: Option[Int] = None): Array[(ValidationResult, ValidationMethod[T])]

def evaluate(): LSTM.this.type

def finalize(): Unit

final def forward(input: Table): Table

var forwardTime: Long

var forwardTimes: Array[Long]

def freeze(names: String*): LSTM.this.type

var gates: Sequential[T]

final def getClass(): Class[_]

def getName(): String

def getNamePostfix: String

def getNumericType(): TensorDataType

def getParameters(): (Tensor[T], Tensor[T])

def getParametersTable(): Table

def getPrintName(): String

def getScaleB(): Double

def getScaleW(): Double

def getTimes(): Array[(AbstractModule[_ <: Activity, _ <: Activity, T], Long, Long)]

def getWeightsBias(): Array[Tensor[T]]

var gradInput: Table

def hasName: Boolean

def hashCode(): Int

def hidResize(hidden: Activity, batchSize: Int, stepShape: Array[Int]): Activity

val hiddenSize: Int

def hiddenSizeOfPreTopo: Int

val hiddensShape: Array[Int]

val inputSize: Int

def inputs(first: (ModuleNode[T], Int), nodesWithIndex: (ModuleNode[T], Int)*): ModuleNode[T]

def inputs(nodes: ModuleNode[T]*): ModuleNode[T]

final def isInstanceOf[T0]: Boolean

final def isTraining(): Boolean

var line: String

def loadModelWeights(srcModel: Module[Float], matchAll: Boolean = true): LSTM.this.type

def loadWeights(weightPath: String, matchAll: Boolean = true): LSTM.this.type

final def ne(arg0: AnyRef): Boolean

final def notify(): Unit

final def notifyAll(): Unit

var output: Table

val p: Double

def parameters(): (Array[Tensor[T]], Array[Tensor[T]])

var preTopology: TensorModule[T]

def predict(dataset: RDD[Sample[T]], batchSize: Int = 1, shareBuffer: Boolean = false): RDD[Activity]

def predictClass(dataset: RDD[Sample[T]], batchSize: Int = 1): RDD[Int]

def quantize(): Module[T]

def regluarized(isRegularized: Boolean): Unit

var regularizers: Array[Regularizer[T]]

def reset(): Unit

def resetTimes(): Unit

def saveCaffe(prototxtPath: String, modelPath: String, useV2: Boolean = true, overwrite: Boolean = false): LSTM.this.type