org.allenai.nlpstack.parse.poly.ml

TrainingData

case class TrainingData(labeledVectors: Iterable[(FeatureVector, Double)]) extends Product with Serializable

Abstraction for a set of labeled feature vectors.

Provides various serialization options for different machine learning tools.

labeledVectors

a sequence of feature vectors labeled with doubles

Linear Supertypes
Serializable, Serializable, Product, Equals, AnyRef, Any
Known Subclasses
Ordering
  1. Alphabetic
  2. By inheritance
Inherited
  1. TrainingData
  2. Serializable
  3. Serializable
  4. Product
  5. Equals
  6. AnyRef
  7. Any
  1. Hide All
  2. Show all
Learn more about member selection
Visibility
  1. Public
  2. All

Instance Constructors

  1. new TrainingData(labeledVectors: Iterable[(FeatureVector, Double)])

    labeledVectors

    a sequence of feature vectors labeled with doubles

Value Members

  1. final def !=(arg0: AnyRef): Boolean

    Definition Classes
    AnyRef
  2. final def !=(arg0: Any): Boolean

    Definition Classes
    Any
  3. final def ##(): Int

    Definition Classes
    AnyRef → Any
  4. final def ==(arg0: AnyRef): Boolean

    Definition Classes
    AnyRef
  5. final def ==(arg0: Any): Boolean

    Definition Classes
    Any
  6. final def asInstanceOf[T0]: T0

    Definition Classes
    Any
  7. def asSvmLight(signature: FeatureEncoding): String

    Expresses this training data in "SVMlight" format, which is <line> .

    Expresses this training data in "SVMlight" format, which is <line> .=. <target> <feature>:<value> ... <feature>:<value> # <info> <target> .=. +1 | -1 | 0 | <float> <feature> .=. <integer> | "qid" <value> .=. <float> <info> .=. <string>

    signature

    the signature to use for encoding feature names as integer

    returns

    the training data in SVMlight format

  8. def binarize(margin: Double): BinaryTrainingData

    Creates "positive" and "negative" feature vectors according to whether the feature cost is greater than margin or less than -margin, respectively.

    Creates "positive" and "negative" feature vectors according to whether the feature cost is greater than margin or less than -margin, respectively.

    Feature vectors that are within margin of zero are filtered from the traing data.

    margin

    the absolute threshold that determines whether a vector is kept

    returns

    a TrainingData instance where all costs are -1 or 1

  9. def clone(): AnyRef

    Attributes
    protected[java.lang]
    Definition Classes
    AnyRef
    Annotations
    @throws( ... )
  10. final def eq(arg0: AnyRef): Boolean

    Definition Classes
    AnyRef
  11. lazy val featureNames: Set[FeatureName]

    The set of feature names found in the training data.

  12. def finalize(): Unit

    Attributes
    protected[java.lang]
    Definition Classes
    AnyRef
    Annotations
    @throws( classOf[java.lang.Throwable] )
  13. final def getClass(): Class[_]

    Definition Classes
    AnyRef → Any
  14. final def isInstanceOf[T0]: Boolean

    Definition Classes
    Any
  15. val labeledVectors: Iterable[(FeatureVector, Double)]

    a sequence of feature vectors labeled with doubles

  16. final def ne(arg0: AnyRef): Boolean

    Definition Classes
    AnyRef
  17. final def notify(): Unit

    Definition Classes
    AnyRef
  18. final def notifyAll(): Unit

    Definition Classes
    AnyRef
  19. def svmLightLabel(label: Double): String

    Attributes
    protected
  20. final def synchronized[T0](arg0: ⇒ T0): T0

    Definition Classes
    AnyRef
  21. final def wait(): Unit

    Definition Classes
    AnyRef
    Annotations
    @throws( ... )
  22. final def wait(arg0: Long, arg1: Int): Unit

    Definition Classes
    AnyRef
    Annotations
    @throws( ... )
  23. final def wait(arg0: Long): Unit

    Definition Classes
    AnyRef
    Annotations
    @throws( ... )

Inherited from Serializable

Inherited from Serializable

Inherited from Product

Inherited from Equals

Inherited from AnyRef

Inherited from Any

Ungrouped