Package org.deeplearning4j.nn.layers
Class BaseOutputLayer<LayerConfT extends BaseOutputLayer>
- java.lang.Object
-
- org.deeplearning4j.nn.layers.AbstractLayer<LayerConfT>
-
- org.deeplearning4j.nn.layers.BaseLayer<LayerConfT>
-
- org.deeplearning4j.nn.layers.BaseOutputLayer<LayerConfT>
-
- All Implemented Interfaces:
Serializable,Cloneable,Classifier,Layer,IOutputLayer,Model,Trainable
- Direct Known Subclasses:
CenterLossOutputLayer,OCNNOutputLayer,OutputLayer,RnnOutputLayer
public abstract class BaseOutputLayer<LayerConfT extends BaseOutputLayer> extends BaseLayer<LayerConfT> implements Serializable, IOutputLayer
- See Also:
- Serialized Form
-
-
Nested Class Summary
-
Nested classes/interfaces inherited from interface org.deeplearning4j.nn.api.Layer
Layer.TrainingMode, Layer.Type
-
-
Field Summary
Fields Modifier and Type Field Description protected INDArrayinputMaskArrayprotected MaskStateinputMaskArrayStateprotected INDArraylabels-
Fields inherited from class org.deeplearning4j.nn.layers.BaseLayer
gradient, gradientsFlattened, gradientViews, optimizer, params, paramsFlattened, score, weightNoiseParams
-
Fields inherited from class org.deeplearning4j.nn.layers.AbstractLayer
cacheMode, conf, dataType, dropoutApplied, epochCount, index, input, inputModificationAllowed, iterationCount, maskArray, maskState, preOutput, trainingListeners
-
-
Constructor Summary
Constructors Constructor Description BaseOutputLayer(NeuralNetConfiguration conf, DataType dataType)
-
Method Summary
All Methods Instance Methods Abstract Methods Concrete Methods Modifier and Type Method Description INDArrayactivate(INDArray input, boolean training, LayerWorkspaceMgr workspaceMgr)Perform forward pass and return the activations array with the specified inputprotected voidapplyMask(INDArray to)Pair<Gradient,INDArray>backpropGradient(INDArray epsilon, LayerWorkspaceMgr workspaceMgr)Calculate the gradient relative to the error in the next layervoidclear()Clear inputvoidcomputeGradientAndScore(LayerWorkspaceMgr workspaceMgr)Update the scoredoublecomputeScore(double fullNetRegTerm, boolean training, LayerWorkspaceMgr workspaceMgr)Compute score after labels and input have been set.INDArraycomputeScoreForExamples(double fullNetRegTerm, LayerWorkspaceMgr workspaceMgr)Compute the score for each example individually, after labels and input have been set.doublef1Score(INDArray examples, INDArray labels)Returns the f1 score for the given examples.doublef1Score(DataSet data)Sets the input and labels and returns a score for the prediction wrt true labelsvoidfit(INDArray examples, int[] labels)Fit the modelvoidfit(INDArray data, LayerWorkspaceMgr workspaceMgr)Fit the model to the given datavoidfit(INDArray input, INDArray labels)Fit the modelvoidfit(DataSet data)Fit the modelvoidfit(DataSetIterator iter)Train the model based on the datasetiteratorINDArraygetLabels()Get the labels array previously set withIOutputLayer.setLabels(INDArray)protected abstract INDArraygetLabels2d(LayerWorkspaceMgr workspaceMgr, ArrayType arrayType)Gradientgradient()Gets the gradient from one training iterationPair<Gradient,Double>gradientAndScore()Get the gradient and scorebooleanhasBias()Does this layer have no bias term? Many layers (dense, convolutional, output, embedding) have biases by default, but no-bias versions are possible via configurationbooleanisPretrainLayer()Returns true if the layer can be trained in an unsupervised/pretrain manner (AE, VAE, etc)booleanneedsLabels()Returns true if labels are required for this output layerintnumLabels()Returns the number of possible labelsint[]predict(INDArray input)Returns the predictions for each example in the datasetList<String>predict(DataSet dataSet)Return predicted label namesprotected INDArraypreOutput2d(boolean training, LayerWorkspaceMgr workspaceMgr)voidsetLabels(INDArray labels)Set the labels array for this output layerprotected voidsetScoreWithZ(INDArray z)-
Methods inherited from class org.deeplearning4j.nn.layers.BaseLayer
activate, calcRegularizationScore, clearNoiseWeightParams, clone, fit, getGradientsViewArray, getOptimizer, getParam, getParamWithNoise, hasLayerNorm, layerConf, numParams, params, paramTable, paramTable, preOutput, preOutputWithPreNorm, score, setBackpropGradientsViewArray, setParam, setParams, setParams, setParamsViewArray, setParamTable, toString, update, update
-
Methods inherited from class org.deeplearning4j.nn.layers.AbstractLayer
addListeners, allowInputModification, applyConstraints, applyDropOutIfNecessary, assertInputSet, backpropDropOutIfPresent, batchSize, close, conf, feedForwardMaskArray, getConfig, getEpochCount, getHelper, getIndex, getInput, getInputMiniBatchSize, getListeners, getMaskArray, init, input, layerId, numParams, setCacheMode, setConf, setEpochCount, setIndex, setInput, setInputMiniBatchSize, setListeners, setListeners, setMaskArray, type, updaterDivideByMinibatch
-
Methods inherited from class java.lang.Object
equals, finalize, getClass, hashCode, notify, notifyAll, wait, wait, wait
-
Methods inherited from interface org.deeplearning4j.nn.api.Layer
activate, allowInputModification, calcRegularizationScore, clearNoiseWeightParams, feedForwardMaskArray, getEpochCount, getHelper, getIndex, getInputMiniBatchSize, getIterationCount, getListeners, getMaskArray, setCacheMode, setEpochCount, setIndex, setInput, setInputMiniBatchSize, setIterationCount, setListeners, setListeners, setMaskArray, type
-
Methods inherited from interface org.deeplearning4j.nn.api.Model
addListeners, applyConstraints, batchSize, close, conf, fit, getGradientsViewArray, getOptimizer, getParam, init, input, numParams, numParams, params, paramTable, paramTable, score, setBackpropGradientsViewArray, setConf, setParam, setParams, setParamsViewArray, setParamTable, update, update
-
Methods inherited from interface org.deeplearning4j.nn.api.Trainable
getConfig, getGradientsViewArray, numParams, params, paramTable, updaterDivideByMinibatch
-
-
-
-
Constructor Detail
-
BaseOutputLayer
public BaseOutputLayer(NeuralNetConfiguration conf, DataType dataType)
-
-
Method Detail
-
computeScore
public double computeScore(double fullNetRegTerm, boolean training, LayerWorkspaceMgr workspaceMgr)Compute score after labels and input have been set.- Specified by:
computeScorein interfaceIOutputLayer- Parameters:
fullNetRegTerm- Regularization score term for the entire networktraining- whether score should be calculated at train or test time (this affects things like application of dropout, etc)- Returns:
- score (loss function)
-
needsLabels
public boolean needsLabels()
Description copied from interface:IOutputLayerReturns true if labels are required for this output layer- Specified by:
needsLabelsin interfaceIOutputLayer- Returns:
- true if this output layer needs labels or not
-
computeScoreForExamples
public INDArray computeScoreForExamples(double fullNetRegTerm, LayerWorkspaceMgr workspaceMgr)
Compute the score for each example individually, after labels and input have been set.- Specified by:
computeScoreForExamplesin interfaceIOutputLayer- Parameters:
fullNetRegTerm- Regularization score term for the entire network (or, 0.0 to not include regularization)- Returns:
- A column INDArray of shape [numExamples,1], where entry i is the score of the ith example
-
computeGradientAndScore
public void computeGradientAndScore(LayerWorkspaceMgr workspaceMgr)
Description copied from interface:ModelUpdate the score- Specified by:
computeGradientAndScorein interfaceModel- Overrides:
computeGradientAndScorein classBaseLayer<LayerConfT extends BaseOutputLayer>
-
setScoreWithZ
protected void setScoreWithZ(INDArray z)
- Overrides:
setScoreWithZin classBaseLayer<LayerConfT extends BaseOutputLayer>
-
gradientAndScore
public Pair<Gradient,Double> gradientAndScore()
Description copied from interface:ModelGet the gradient and score- Specified by:
gradientAndScorein interfaceModel- Overrides:
gradientAndScorein classAbstractLayer<LayerConfT extends BaseOutputLayer>- Returns:
- the gradient and score
-
backpropGradient
public Pair<Gradient,INDArray> backpropGradient(INDArray epsilon, LayerWorkspaceMgr workspaceMgr)
Description copied from interface:LayerCalculate the gradient relative to the error in the next layer- Specified by:
backpropGradientin interfaceLayer- Overrides:
backpropGradientin classBaseLayer<LayerConfT extends BaseOutputLayer>- Parameters:
epsilon- w^(L+1)*delta^(L+1). Or, equiv: dC/da, i.e., (dC/dz)*(dz/da) = dC/da, where C is cost function a=sigma(z) is activation.workspaceMgr- Workspace manager- Returns:
- Pair
where Gradient is gradient for this layer, INDArray is epsilon (activation gradient) needed by next layer, but before element-wise multiply by sigmaPrime(z). So for standard feed-forward layer, if this layer is L, then return.getSecond() == dL/dIn = (w^(L)*(delta^(L))^T)^T. Note that the returned array should be placed in the ArrayType.ACTIVATION_GRADworkspace via the workspace manager
-
gradient
public Gradient gradient()
Gets the gradient from one training iteration- Specified by:
gradientin interfaceModel- Overrides:
gradientin classBaseLayer<LayerConfT extends BaseOutputLayer>- Returns:
- the gradient (bias and weight matrix)
-
activate
public INDArray activate(INDArray input, boolean training, LayerWorkspaceMgr workspaceMgr)
Description copied from interface:LayerPerform forward pass and return the activations array with the specified input- Specified by:
activatein interfaceLayer- Overrides:
activatein classAbstractLayer<LayerConfT extends BaseOutputLayer>- Parameters:
input- the input to usetraining- train or test modeworkspaceMgr- Workspace manager.- Returns:
- Activations array. Note that the returned array should be placed in the
ArrayType.ACTIVATIONSworkspace via the workspace manager
-
f1Score
public double f1Score(DataSet data)
Sets the input and labels and returns a score for the prediction wrt true labels- Specified by:
f1Scorein interfaceClassifier- Parameters:
data- the data to score- Returns:
- the score for the given input,label pairs
-
f1Score
public double f1Score(INDArray examples, INDArray labels)
Returns the f1 score for the given examples.- Specified by:
f1Scorein interfaceClassifier- Parameters:
examples- te the examples to classify (one example in each row)labels- the true labels- Returns:
- the scores for each ndarray
-
numLabels
public int numLabels()
Returns the number of possible labels- Specified by:
numLabelsin interfaceClassifier- Returns:
- the number of possible labels for this classifier
-
fit
public void fit(DataSetIterator iter)
Description copied from interface:ClassifierTrain the model based on the datasetiterator- Specified by:
fitin interfaceClassifier- Parameters:
iter- the iterator to train on
-
predict
public int[] predict(INDArray input)
Returns the predictions for each example in the dataset- Specified by:
predictin interfaceClassifier- Parameters:
input- the matrix to predict- Returns:
- the prediction for the dataset
-
predict
public List<String> predict(DataSet dataSet)
Return predicted label names- Specified by:
predictin interfaceClassifier- Parameters:
dataSet- to predict- Returns:
- the predicted labels for the dataSet
-
fit
public void fit(INDArray input, INDArray labels)
Fit the model- Specified by:
fitin interfaceClassifier- Parameters:
input- the examples to classify (one example in each row)labels- the example labels(a binary outcome matrix)
-
fit
public void fit(DataSet data)
Fit the model- Specified by:
fitin interfaceClassifier- Parameters:
data- the data to train on
-
fit
public void fit(INDArray examples, int[] labels)
Fit the model- Specified by:
fitin interfaceClassifier- Parameters:
examples- the examples to classify (one example in each row)labels- the labels for each example (the number of labels must match
-
clear
public void clear()
Description copied from interface:ModelClear input- Specified by:
clearin interfaceModel- Overrides:
clearin classBaseLayer<LayerConfT extends BaseOutputLayer>
-
fit
public void fit(INDArray data, LayerWorkspaceMgr workspaceMgr)
Description copied from interface:ModelFit the model to the given data- Specified by:
fitin interfaceModel- Overrides:
fitin classBaseLayer<LayerConfT extends BaseOutputLayer>- Parameters:
data- the data to fit the model to
-
getLabels
public INDArray getLabels()
Description copied from interface:IOutputLayerGet the labels array previously set withIOutputLayer.setLabels(INDArray)- Specified by:
getLabelsin interfaceIOutputLayer- Returns:
- Labels array, or null if it has not been set
-
setLabels
public void setLabels(INDArray labels)
Description copied from interface:IOutputLayerSet the labels array for this output layer- Specified by:
setLabelsin interfaceIOutputLayer- Parameters:
labels- Labels array to set
-
preOutput2d
protected INDArray preOutput2d(boolean training, LayerWorkspaceMgr workspaceMgr)
-
applyMask
protected void applyMask(INDArray to)
- Overrides:
applyMaskin classAbstractLayer<LayerConfT extends BaseOutputLayer>
-
getLabels2d
protected abstract INDArray getLabels2d(LayerWorkspaceMgr workspaceMgr, ArrayType arrayType)
-
isPretrainLayer
public boolean isPretrainLayer()
Description copied from interface:LayerReturns true if the layer can be trained in an unsupervised/pretrain manner (AE, VAE, etc)- Specified by:
isPretrainLayerin interfaceLayer- Returns:
- true if the layer can be pretrained (using fit(INDArray), false otherwise
-
hasBias
public boolean hasBias()
Description copied from class:BaseLayerDoes this layer have no bias term? Many layers (dense, convolutional, output, embedding) have biases by default, but no-bias versions are possible via configuration- Overrides:
hasBiasin classBaseLayer<LayerConfT extends BaseOutputLayer>- Returns:
- True if a bias term is present, false otherwise
-
-