Maxent

java.lang.Object
- smile.classification.Maxent

All Implemented Interfaces:

java.io.Serializable, java.util.function.ToDoubleFunction<int[]>, java.util.function.ToIntFunction<int[]>, Classifier<int[]>, OnlineClassifier<int[]>, SoftClassifier<int[]>
```
public class Maxent
extends java.lang.Object
implements SoftClassifier<int[]>, OnlineClassifier<int[]>
```
Maximum Entropy Classifier. Maximum entropy is a technique for learning probability distributions from data. In maximum entropy models, the observed data itself is assumed to be the testable information. Maximum entropy models don't assume anything about the probability distribution other than what have been observed and always choose the most uniform distribution subject to the observed constraints.
Basically, maximum entropy classifier is another name of multinomial logistic regression applied to categorical independent variables, which are converted to binary dummy variables. Maximum entropy models are widely used in natural language processing. Here, we provide an implementation which assumes that binary features are stored in a sparse array, of which entries are the indices of nonzero features.
See Also:
References A. L. Berger, S. D. Pietra, and V. J. D. Pietra. A maximum entropy approach to natural language processing. Computational Linguistics 22(1):39-71, 1996.
, Serialized Form

Constructor Summary

Constructors
Constructor and Description
`Maxent(double L, double[] w)` Constructor of binary maximum entropy classifier.
`Maxent(double L, double[][] W)` Constructor of multi-class maximum entropy classifier.
`Maxent(double L, double[][] W, smile.util.IntSet labels)` Constructor of multi-class maximum entropy classifier.
`Maxent(double L, double[] w, smile.util.IntSet labels)` Constructor of binary maximum entropy classifier.

Method Summary

All Methods Static Methods Instance Methods Concrete Methods
Modifier and Type	Method and Description
`int`	`dimension()` Returns the dimension of input space.
`static Maxent`	`fit(int p, int[][] x, int[] y)` Learn maximum entropy classifier.
`static Maxent`	`fit(int p, int[][] x, int[] y, double lambda, double tol, int maxIter)` Learn maximum entropy classifier.
`static Maxent`	`fit(int p, int[][] x, int[] y, java.util.Properties prop)` Learn maximum entropy classifier.
`double`	`getLearningRate()` Returns the learning rate of stochastic gradient descent.
`double`	`loglikelihood()` Returns the log-likelihood of model.
`int`	`predict(int[] x)` Predicts the class label of an instance.
`int`	`predict(int[] x, double[] posteriori)` Predicts the class label of an instance and also calculate a posteriori probabilities.
`void`	`setLearningRate(double rate)` Sets the learning rate of stochastic gradient descent.
`void`	`update(int[] x, int y)` Online update the classifier with a new training instance.

Methods inherited from class java.lang.Object
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait

Methods inherited from interface smile.classification.OnlineClassifier
update

Methods inherited from interface smile.classification.Classifier
applyAsDouble, applyAsInt, f, predict

- Constructor Detail
  - Maxent
```
public Maxent(double L,
              double[] w)
```
    Constructor of binary maximum entropy classifier.
    
    Parameters:
    
    L - the log-likelihood of learned model.
    
    w - the weights.
  - Maxent
```
public Maxent(double L,
              double[] w,
              smile.util.IntSet labels)
```
    Constructor of binary maximum entropy classifier.
    
    Parameters:
    
    L - the log-likelihood of learned model.
    
    w - the weights.
    
    labels - class labels
  - Maxent
```
public Maxent(double L,
              double[][] W)
```
    Constructor of multi-class maximum entropy classifier.
    
    Parameters:
    
    L - the log-likelihood of learned model.
    
    W - the weights of first k - 1 classes.
  - Maxent
```
public Maxent(double L,
              double[][] W,
              smile.util.IntSet labels)
```
    Constructor of multi-class maximum entropy classifier.
    
    Parameters:
    
    L - the log-likelihood of learned model.
    
    W - the weights of first k - 1 classes.
    
    labels - class labels
- Method Detail
  - fit
```
public static Maxent fit(int p,
                         int[][] x,
                         int[] y)
```
    Learn maximum entropy classifier.
    
    Parameters:
    
    p - the dimension of feature space.
    
    x - training samples. Each sample is represented by a set of sparse binary features. The features are stored in an integer array, of which are the indices of nonzero features.
    
    y - training labels in [0, k), where k is the number of classes.
  - fit
```
public static Maxent fit(int p,
                         int[][] x,
                         int[] y,
                         java.util.Properties prop)
```
    Learn maximum entropy classifier.
    
    Parameters:
    
    p - the dimension of feature space.
    
    x - training samples. Each sample is represented by a set of sparse binary features. The features are stored in an integer array, of which are the indices of nonzero features.
    
    y - training labels in [0, k), where k is the number of classes.
  - fit
```
public static Maxent fit(int p,
                         int[][] x,
                         int[] y,
                         double lambda,
                         double tol,
                         int maxIter)
```
    Learn maximum entropy classifier.
    
    Parameters:
    
    p - the dimension of feature space.
    
    x - training samples. Each sample is represented by a set of sparse binary features. The features are stored in an integer array, of which are the indices of nonzero features.
    
    y - training labels in [0, k), where k is the number of classes.
    
    lambda - λ > 0 gives a "regularized" estimate of linear weights which often has superior generalization performance, especially when the dimensionality is high.
    
    tol - the tolerance for stopping iterations.
    
    maxIter - maximum number of iterations.
  - dimension
```
public int dimension()
```
    Returns the dimension of input space.
    
    Returns:
    
    the dimension of input space.
  - update
```
public void update(int[] x,
                   int y)
```
    Description copied from interface: OnlineClassifier
    
    Online update the classifier with a new training instance. In general, this method may be NOT multi-thread safe.
    
    Specified by:
    
    update in interface OnlineClassifier<int[]>
    
    Parameters:
    
    x - training instance.
    
    y - training label.
  - setLearningRate
```
public void setLearningRate(double rate)
```
    Sets the learning rate of stochastic gradient descent. It is a good practice to adapt the learning rate for different data sizes. For example, it is typical to set the learning rate to eta/n, where eta is in [0.1, 0.3] and n is the size of the training data.
    
    Parameters:
    
    rate - the learning rate.
  - getLearningRate
```
public double getLearningRate()
```
    Returns the learning rate of stochastic gradient descent.
  - loglikelihood
```
public double loglikelihood()
```
    Returns the log-likelihood of model.
  - predict
```
public int predict(int[] x)
```
    Description copied from interface: Classifier
    
    Predicts the class label of an instance.
    
    Specified by:
    
    predict in interface Classifier<int[]>
    
    Parameters:
    
    x - the instance to be classified.
    
    Returns:
    
    the predicted class label.
  - predict
```
public int predict(int[] x,
                   double[] posteriori)
```
    Description copied from interface: SoftClassifier
    
    Predicts the class label of an instance and also calculate a posteriori probabilities. Classifiers may NOT support this method since not all classification algorithms are able to calculate such a posteriori probabilities.
    
    Specified by:
    
    predict in interface SoftClassifier<int[]>
    
    Parameters:
    
    x - an instance to be classified.
    
    posteriori - the array to store a posteriori probabilities on output.
    
    Returns:
    
    the predicted class label

Class Maxent

References

Constructor Summary

Method Summary

Methods inherited from class java.lang.Object

Methods inherited from interface smile.classification.OnlineClassifier

Methods inherited from interface smile.classification.Classifier

Constructor Detail

Maxent

Maxent

Maxent

Maxent

Method Detail

fit

fit

fit

dimension

update

setLearningRate

getLearningRate

loglikelihood

predict

predict