A meta pipeline which for a particular value of the layer parameters, returns a data pipe which takes as input Stream of Tuple2 consisting of delta's and gradients of activation function with respect to their local fields (calculated via Activation.grad).
A data pipeline which takes Tuple3 consisting of output layer activations, targets and gradients of output activations with respect to their local fields, respectively and returns the output layer delta values and the loss.
A data pipeline which takes as input a Stream of Tuple2 whose first element is the activation and second element the delta value and outputs the gradient of the layer parameters.
Performs the actual update to the layer parameters after all the gradients have been calculated.
Solve the optimization problem of determining NeuralStack weights, from training data.
Solve the optimization problem of determining NeuralStack weights, from training data.
The number of training data points
Training data
The initial NeuralStack before training
A NeuralStack with the learned layer weights and biases.
Set fraction of data to be used for each SGD iteration.
Set fraction of data to be used for each SGD iteration. Default 1.0 (corresponding to deterministic/classical gradient descent)
Set the number of iterations for SGD.
Set the number of iterations for SGD. Default 100.
Set the regularization parameter.
Set the regularization parameter. Default 0.0.
Set the initial step size of SGD for the first step.
Set the initial step size of SGD for the first step. Default 1.0. In subsequent steps, the step size will decrease with stepSize/sqrt(t)
The type of the parameters for each layer
The type of input/output patterns.