Class for a bucketizer model.
Class for an element wise product model.
Class for an element wise product model.
vector for scaling feature vectors
Class for hashing token frequencies into a vector.
Class for hashing token frequencies into a vector.
Source adapted from: Apache Spark Utils and HashingTF, see NOTICE for contributors
size of feature vector to hash into
Class for MaxAbs Scaler model.
Class for MinMax Scaler Transformer
Class for MinMax Scaler Transformer
MinMax Scaler will use the Min/Max values to scale input features.
minimum values from training features
maximum values from training features
Created by mikhail on 9/29/16.
Class for storing a normalizer model.
Class for a one hot encoder model.
Class for a one hot encoder model.
One hot encoders are used to vectorize nominal features in preparation for models such as linear regression or logistic regression where binary and not multinomial features are supported in the feature vector.
size of the output one hot vectors
Class for principal components analysis model.
Class for principal components analysis model.
matrix of principal components
Created by mikhail on 10/16/16.
Created by mikhail on 10/16/16.
Class for a reverse string indexer model.
Class for a reverse string indexer model.
This model reverses the StringIndexerModel model. Use this to go from an integer representation of a label to a string.
labels for reverse string indexing
Class for standard scaler models.
Class for standard scaler models.
Standard scaler will use stddev, mean, or both to scale a feature vector down.
optional standard deviations of features
optional means of features
Created by mikhail on 10/16/16.
Class for string indexer model.
Class for string indexer model.
String indexer converts a string into an integer representation.
list of labels that can be indexed
Class for a tokenizer model.
Class for a tokenizer model.
Default regular expression for tokenizing strings is defined by TokenizerModel.defaultTokenizer
regular expression used for tokenizing strings
Class for a vector assembler model.
Class for a vector assembler model.
Vector assemblers take an input set of doubles and vectors and create a new vector out of them. This is primarily used to get all desired features into one vector before training a model.
Companion object for defaults.
Companion object for defaults.
Class for a bucketizer model.
Bucketizer will place incoming feature into a bucket.
splits used to determine bucket