All Classes and Interfaces

Frequent item set mining based on the FP-growth (frequent pattern growth) algorithm, which employs an extended prefix-tree (FP-tree) structure to store the database in a compressed form.

FPTree

FP-tree data structure used in FP-growth (frequent pattern growth) algorithm for frequent item set mining.

FScore

The F-score (or F-measure) considers both the precision and the recall of the test to compute the score.

GAFE

Genetic algorithm based feature selection.

GaussianProcessRegression<T>

Gaussian Process for Regression.

GHA

Generalized Hebbian Algorithm.

GLM

Generalized linear models.

GMeans

G-Means clustering algorithm, an extended K-Means which tries to automatically determine the number of clusters by normality test.

GradientTreeBoost

Gradient boosting for classification.

GradientTreeBoost

Gradient boosting for regression.

GrowingNeuralGas

Growing Neural Gas.

HashEncoder

Feature hashing, also known as the hashing trick, is a fast and space-efficient way of vectorizing features, i.e.

HiddenLayer

A hidden layer in the neural network.

HiddenLayerBuilder

The builder of hidden layers.

HierarchicalClustering

Agglomerative Hierarchical Clustering.

HMM

First-order Hidden Markov Model.

HMMLabeler<T>

First-order Hidden Markov Model sequence labeler.

Hyperparameters

Hyperparameter configuration.

InformationValue

Information Value (IV) measures the predictive strength of a feature for a binary dependent variable.

InputLayer

An input layer in the neural network.

InternalNode

An internal node in CART.

IsolationForest

Isolation forest is an unsupervised learning algorithm for anomaly detection that works on the principle of isolating anomalies.

IsolationTree

Isolation tree.

IsoMap

Isometric feature mapping.

IsotonicMDS

Kruskal's non-metric MDS.

IsotonicRegressionScaling

A method to calibrate decision function value to probability.

ItemSet

A set of items.

KernelMachine<T>

Kernel machines.

KernelMachine<T>

The learning methods building on kernels.

KernelPCA

Kernel PCA transform.

KMeans

K-Means clustering.

KMedoidsImputer

Missing value imputation by K-Medoids clustering.

KModes

K-Modes clustering.

KNN<T>

K-nearest neighbor classifier.

KNNImputer

Missing value imputation with k-nearest neighbors.

KPCA<T>

Kernel principal component analysis.

LaplacianEigenmap

Laplacian Eigenmap.

LASSO

Lasso (least absolute shrinkage and selection operator) regression.

LASVM<T>

LASVM is an approximate SVM solver that uses online approximation.

Layer

A layer in the neural network.

LayerBuilder

The builder of layers.

LDA

Linear discriminant analysis.

LeafNode

A leaf node in decision tree.

LeakyReLU

The leaky rectifier activation function max(x, ax) where 0 <= a < 1.

LinearKernelMachine

Linear kernel machine.

LinearModel

Linear model.

Linkage

A measure of dissimilarity between clusters (i.e.

LLE

Locally Linear Embedding.

LogisticRegression

Logistic regression.

LogisticRegression.Binomial

Binomial logistic regression.

LogisticRegression.Multinomial

Multinomial logistic regression.

LogLoss

Log loss is a evaluation metric for binary classifiers and it is sometimes the optimization objective as well in case of logistic regression and neural networks.

LOOCV

Leave-one-out cross validation.

Loss

Regression loss function.

Loss.Type

The type of loss.

MAD

Mean absolute deviation error.

MatthewsCorrelation

Matthews correlation coefficient.

MaxAbsScaler

Scales each feature by its maximum absolute value.

Maxent

Maximum Entropy Classifier.

Maxent.Binomial

Binomial maximum entropy classifier.

Maxent.Multinomial

Multinomial maximum entropy classifier.

MDS

Classical multidimensional scaling, also known as principal coordinates analysis.

MEC<T>

Non-parametric Minimum Conditional Entropy Clustering.

MLP

Fully connected multilayer perceptron neural network for classification.

MLP

Fully connected multilayer perceptron neural network for regression.

Model

The GLM model specification.

ModelSelection

Model selection criteria.

MSE

Mean squared error.

MultilayerPerceptron

Fully connected multilayer perceptron neural network.

MutualInformation

Mutual Information for comparing clustering.

NaiveBayes

Naive Bayes classifier.

Neighborhood

The neighborhood function for 2-dimensional lattice topology (e.g.

NeuralGas

Neural Gas soft competitive learning algorithm.

NeuralMap

NeuralMap is an efficient competitive learning algorithm inspired by growing neural gas and BIRCH.

Neuron

The neuron vertex in the growing neural gas network.

Node

CART tree node.

NominalNode

A node with a nominal split variable.

NominalSplit

The data about of a potential split for a leaf node.

NormalizedMutualInformation

Normalized Mutual Information (NMI) for comparing clustering.

NormalizedMutualInformation.Method

The normalization method.

Normalizer

Normalize samples individually to unit norm.

Normalizer.Norm

Vector norm.

OCSVM<T>

One-class support vector machine.

OLS

Ordinary least squares.

OneVersusOne<T>

One-vs-one strategy for reducing the problem of multiclass classification to multiple binary classification problems.

OneVersusRest<T>

One-vs-rest (or one-vs-all) strategy for reducing the problem of multiclass classification to multiple binary classification problems.

Optimizer

The neural network optimizer.

OrdinalNode

A node with a ordinal split variable (real-valued or ordinal categorical value).

OrdinalSplit

The data about of a potential split for a leaf node.

OutputFunction

The output function of neural networks.

OutputLayer

The output layer in the neural network.

OutputLayerBuilder

The builder of output layers.

PartitionClustering

Partition clustering.

PCA

Principal component analysis.

PlattScaling

Platt scaling or Platt calibration is a way of transforming the outputs of a classification model into a probability distribution over classes.

Poisson

The response variable is of Poisson distribution.

Precision

The precision or positive predictive value (PPV) is ratio of true positives to combined true and false positives, which is different from sensitivity.

ProbabilisticClassificationMetric

An abstract interface to measure the probabilistic classification performance.

ProbabilisticPCA

Probabilistic principal component analysis.

Projection

A projection is a kind of feature extraction technique that transforms data from the input space to a feature space, linearly or non-linearly.

QDA

Quadratic discriminant analysis.

R².

RandIndex

Rand Index.

RandomForest

Random forest for classification.

RandomForest

Random forest for regression.

RandomForest.Model

The base model.

RandomForest.Model

The base model.

RandomProjection

Random projection is a promising dimensionality reduction technique for learning mixtures of Gaussians.

RBF<T>

A neuron in radial basis function network.

RBFNetwork<T>

Radial basis function networks.

RBFNetwork<T>

Radial basis function network.

RDA

Regularized discriminant analysis.

Recall

In information retrieval area, sensitivity is called recall.

Regression<T>

Regression analysis includes any techniques for modeling and analyzing the relationship between a dependent variable and one or more independent variables.

Regression.Trainer<T,M extends Regression<T>>

The regression trainer.

RegressionMetric

An abstract interface to measure the regression performance.

RegressionMetrics

The regression validation metrics.

RegressionNode

A leaf node in regression tree.

RegressionTree

Regression tree.

RegressionValidation<M>

Regression model validation results.

RegressionValidations<M>

Regression model validation results.

ReLU

The rectifier activation function max(0, x).

RidgeRegression

Ridge Regression.

RMSE

Root mean squared error.

RMSProp

RMSProp optimizer with adaptive learning rate.

RobustStandardizer

Robustly standardizes numeric feature by subtracting the median and dividing by the IQR.

RSS

Residual sum of squares.

SammonMapping

The Sammon's mapping is an iterative technique for making interpoint distances in the low-dimensional projection as close as possible to the interpoint distances in the high-dimensional object.

Scaler

Scales the numeric variables into the range [0, 1].

Sensitivity

Sensitivity or true positive rate (TPR) (also called hit rate, recall) is a statistical measures of the performance of a binary classification test.

SequenceLabeler<T>

A sequence labeler assigns a class label to each position of the sequence.

SGD

Stochastic gradient descent (with momentum) optimizer.

SHAP<T>

SHAP (SHapley Additive exPlanations) is a game theoretic approach to explain the output of any machine learning model.

SIB

The Sequential Information Bottleneck algorithm.

Sigmoid

Logistic sigmoid function: sigmoid(v)=1/(1+exp(-v)).

SignalNoiseRatio

The signal-to-noise (S2N) metric ratio is a univariate feature ranking metric, which can be used as a feature selection criterion for binary classification problems.

SimpleImputer

Simple algorithm replaces missing values with the constant value along each column.

SingleLinkage

Single linkage.

Softmax

Softmax for multi-class cross entropy objection function.

SOM

Self-Organizing Map.

SparseEncoder

Encodes numeric and categorical features into sparse array with on-hot encoding of categorical variables.

SparseLogisticRegression

Logistic regression on sparse data.

SparseLogisticRegression.Binomial

Binomial logistic regression.

SparseLogisticRegression.Multinomial

Multinomial logistic regression.

Specificity

Specificity (SPC) or True Negative Rate is a statistical measures of the performance of a binary classification test.

SpectralClustering

Spectral Clustering.

Split

The data about of a potential split for a leaf node.

SplitRule

The criterion to choose variable to split instances.

Standardizer

Standardizes numeric feature to 0 mean and unit variance.

SumSquaresRatio

The ratio of between-groups to within-groups sum of squares is a univariate feature ranking metric, which can be used as a feature selection criterion for multi-class classification problems.

SupportVector<T>

Support vector.

SVDImputer

Missing value imputation with singular value decomposition.

SVM<T>

One-class support vector machines for novelty detection.

SVM<T>

Support vector machines for classification.

SVM

Epsilon support vector regression.

SVR<T>

Epsilon support vector regression.