Package

ml.combust.mleap.core

feature

Permalink

package feature

Visibility
  1. Public
  2. All

Type Members

  1. case class BinarizerModel(threshold: Double, inputShape: DataShape) extends Model with Product with Serializable

    Permalink

    Created by mikhail on 10/16/16.

    Created by mikhail on 10/16/16.

    Annotations
    @SparkCode()
  2. sealed trait BinaryOperation extends Serializable

    Permalink

    Created by hollinwilkins on 12/27/16.

  3. case class BucketedRandomProjectionLSHModel(randomUnitVectors: Seq[Vector], bucketLength: Double, inputSize: Int) extends LSHModel with Product with Serializable

    Permalink

    Created by hollinwilkins on 12/28/16.

  4. case class BucketizerModel(splits: Array[Double]) extends Model with Product with Serializable

    Permalink

    Class for a bucketizer model.

    Class for a bucketizer model.

    Bucketizer will place incoming feature into a bucket.

    splits

    splits used to determine bucket

    Annotations
    @SparkCode()
  5. case class ChiSqSelectorModel(filterIndices: Seq[Int], inputSize: Int) extends Model with Product with Serializable

    Permalink

    Created by hollinwilkins on 12/27/16.

    Created by hollinwilkins on 12/27/16.

    Annotations
    @SparkCode()
  6. case class CoalesceModel(nullableInputs: Seq[Boolean]) extends Model with Product with Serializable

    Permalink

    Created by hollinwilkins on 1/5/17.

  7. case class CountVectorizerModel(vocabulary: Array[String], binary: Boolean, minTf: Double) extends Model with Product with Serializable

    Permalink

    Created by hollinwilkins on 12/28/16.

    Created by hollinwilkins on 12/28/16.

    Annotations
    @SparkCode()
  8. case class DCTModel(inverse: Boolean, inputSize: Int) extends Model with Product with Serializable

    Permalink

    Created by hollinwilkins on 12/28/16.

  9. case class ElementwiseProductModel(scalingVec: Vector) extends Model with Product with Serializable

    Permalink

    Class for an element wise product model.

    Class for an element wise product model.

    scalingVec

    vector for scaling feature vectors

    Annotations
    @SparkCode()
  10. case class FeatureEncoder(numFeatures: Array[Int]) extends Product with Serializable

    Permalink
  11. sealed trait HandleInvalid extends AnyRef

    Permalink
  12. case class HashingTermFrequencyModel(numFeatures: Int = 1 << 18, binary: Boolean = false) extends Model with Product with Serializable

    Permalink

    Class for hashing token frequencies into a vector.

    Class for hashing token frequencies into a vector.

    Source adapted from: Apache Spark Utils and HashingTF, see NOTICE for contributors

    numFeatures

    size of feature vector to hash into

    Annotations
    @SparkCode()
  13. case class IDFModel(idf: Vector) extends Model with Product with Serializable

    Permalink

    Created by hollinwilkins on 12/28/16.

    Created by hollinwilkins on 12/28/16.

    Annotations
    @SparkCode()
  14. case class ImputerModel(surrogateValue: Double, missingValue: Double, strategy: String, nullableInput: Boolean = true) extends Model with Product with Serializable

    Permalink

    Created by mikhail on 12/18/16.

  15. case class InteractionModel(featuresSpec: Array[Array[Int]], inputShapes: Seq[DataShape]) extends Model with Product with Serializable

    Permalink

    Created by hollinwilkins on 4/26/17.

    Created by hollinwilkins on 4/26/17.

    Annotations
    @SparkCode()
  16. trait LSHModel extends Model

    Permalink

    Created by hollinwilkins on 12/28/16.

  17. case class LabeledPoint(label: Double, features: Vector) extends Product with Serializable

    Permalink

    Created by hollinwilkins on 12/25/16.

  18. case class MathBinaryModel(operation: BinaryOperation, da: Option[Double] = None, db: Option[Double] = None) extends Model with Product with Serializable

    Permalink
  19. case class MathUnaryModel(operation: UnaryOperation) extends Model with Product with Serializable

    Permalink
  20. case class MaxAbsScalerModel(maxAbs: Vector) extends Model with Product with Serializable

    Permalink

    Class for MaxAbs Scaler model.

    Class for MaxAbs Scaler model.

    maxAbs

    max absolute value

    Annotations
    @SparkCode()
  21. case class MinHashLSHModel(randomCoefficients: Seq[(Int, Int)], inputSize: Int) extends LSHModel with Product with Serializable

    Permalink
  22. case class MinMaxScalerModel(originalMin: Vector, originalMax: Vector) extends Model with Product with Serializable

    Permalink

    Class for MinMax Scaler Transformer

    Class for MinMax Scaler Transformer

    MinMax Scaler will use the Min/Max values to scale input features.

    originalMin

    minimum values from training features

    originalMax

    maximum values from training features

    Annotations
    @SparkCode()
  23. case class MultinomialLabelerModel(threshold: Double, indexer: ReverseStringIndexerModel) extends Model with Product with Serializable

    Permalink
  24. case class NGramModel(n: Int) extends Model with Product with Serializable

    Permalink

    Created by mikhail on 9/29/16.

  25. case class NormalizerModel(pNorm: Double, inputSize: Int) extends Model with Product with Serializable

    Permalink

    Class for storing a normalizer model.

    Class for storing a normalizer model.

    pNorm

    p normalization param

    Annotations
    @SparkCode()
  26. case class OneHotEncoderModel(size: Int, dropLast: Boolean = true) extends Model with Product with Serializable

    Permalink

    Class for a one hot encoder model.

    Class for a one hot encoder model.

    One hot encoders are used to vectorize nominal features in preparation for models such as linear regression or logistic regression where binary and not multinomial features are supported in the feature vector.

    size

    size of the output one hot vectors

  27. case class PcaModel(principalComponents: DenseMatrix) extends Model with Product with Serializable

    Permalink

    Class for principal components analysis model.

    Class for principal components analysis model.

    principalComponents

    matrix of principal components

  28. case class PolynomialExpansionModel(degree: Int, inputSize: Int) extends Model with Product with Serializable

    Permalink

    Created by mikhail on 10/16/16.

    Created by mikhail on 10/16/16.

    Annotations
    @SparkCode()
  29. case class RegexTokenizerModel(regex: Regex, matchGaps: Boolean = true, tokenMinLength: Int = 1, lowercaseText: Boolean = true) extends Model with Product with Serializable

    Permalink
  30. case class ReverseStringIndexerModel(labels: Seq[String]) extends Model with Product with Serializable

    Permalink

    Class for a reverse string indexer model.

    Class for a reverse string indexer model.

    This model reverses the StringIndexerModel model. Use this to go from an integer representation of a label to a string.

    labels

    labels for reverse string indexing

  31. case class StandardScalerModel(std: Option[Vector], mean: Option[Vector]) extends Model with Product with Serializable

    Permalink

    Class for standard scaler models.

    Class for standard scaler models.

    Standard scaler will use stddev, mean, or both to scale a feature vector down.

    std

    optional standard deviations of features

    mean

    optional means of features

    Annotations
    @SparkCode()
  32. case class StopWordsRemoverModel(stopWords: Seq[String], caseSensitive: Boolean) extends Model with Product with Serializable

    Permalink

    Created by mikhail on 10/16/16.

  33. case class StringIndexerModel(labels: Seq[String], handleInvalid: HandleInvalid = HandleInvalid.Error) extends Model with Product with Serializable

    Permalink

    Class for string indexer model.

    Class for string indexer model.

    String indexer converts a string into an integer representation.

    labels

    list of labels that can be indexed

    handleInvalid

    how to handle invalid values (unseen or NULL labels): 'error' (throw an error), 'skip' (doesn't work in MLeap Runtime and also throws an error) or 'keep' (put invalid data in a special bucket at index labels.size

  34. case class StringMapModel(labels: Map[String, Double]) extends Model with Product with Serializable

    Permalink

    Created by hollinwilkins on 1/5/17.

  35. case class TokenizerModel(regex: String = "\\s") extends Model with Product with Serializable

    Permalink

    Class for a tokenizer model.

    Class for a tokenizer model.

    regex

    regular expression used for tokenizing strings

  36. sealed trait UnaryOperation extends AnyRef

    Permalink

    Created by hollinwilkins on 12/27/16.

  37. case class VectorAssemblerModel(inputShapes: Seq[DataShape]) extends Model with Product with Serializable

    Permalink

    Class for a vector assembler model.

    Class for a vector assembler model.

    Vector assemblers take an input set of doubles and vectors and create a new vector out of them. This is primarily used to get all desired features into one vector before training a model.

    Annotations
    @SparkCode()
  38. case class VectorIndexerModel(numFeatures: Int, categoryMaps: Map[Int, Map[Double, Int]]) extends Model with Product with Serializable

    Permalink

    Created by hollinwilkins on 12/28/16.

    Created by hollinwilkins on 12/28/16.

    Annotations
    @SparkCode()
  39. case class VectorSlicerModel(indices: Array[Int], namedIndices: Array[(String, Int)] = Array(), inputSize: Int) extends Model with Product with Serializable

    Permalink

    Created by hollinwilkins on 12/28/16.

    Created by hollinwilkins on 12/28/16.

    Annotations
    @SparkCode()
  40. case class WordLengthFilterModel(length: Int = 3) extends Model with Product with Serializable

    Permalink

    Created by mageswarand on 14/2/17.

  41. case class WordToVectorModel(wordIndex: Map[String, Int], wordVectors: Array[Double]) extends Model with Product with Serializable

    Permalink

    Created by hollinwilkins on 12/28/16.

Value Members

  1. object BinaryOperation extends Serializable

    Permalink
  2. object HandleInvalid

    Permalink
  3. object HashingTermFrequencyModel extends Serializable

    Permalink
  4. object MinHashLSHModel extends Serializable

    Permalink

    Created by hollinwilkins on 12/28/16.

  5. object MultinomialLabelerModel extends Serializable

    Permalink

    Created by hollinwilkins on 1/18/17.

  6. object TokenizerModel extends Serializable

    Permalink

    Companion object for defaults.

  7. object UnaryOperation

    Permalink

Ungrouped