Class

com.databricks.labs.automl.pipeline

FeaturePipeline

Related Doc: package pipeline

Permalink

class FeaturePipeline extends DataValidation

Linear Supertypes
DataValidation, AnyRef, Any
Ordering
  1. Alphabetic
  2. By Inheritance
Inherited
  1. FeaturePipeline
  2. DataValidation
  3. AnyRef
  4. Any
  1. Hide All
  2. Show All
Visibility
  1. Public
  2. All

Instance Constructors

  1. new FeaturePipeline(data: DataFrame, isInferenceRun: Boolean = false)

    Permalink

Value Members

  1. final def !=(arg0: Any): Boolean

    Permalink
    Definition Classes
    AnyRef → Any
  2. final def ##(): Int

    Permalink
    Definition Classes
    AnyRef → Any
  3. final def ==(arg0: Any): Boolean

    Permalink
    Definition Classes
    AnyRef → Any
  4. def _allowableCardinalilties: List[String]

    Permalink
    Definition Classes
    DataValidation
  5. def _allowableCategoricalFilterModes: List[String]

    Permalink
    Definition Classes
    DataValidation
  6. def _allowableDateTimeConversions: List[String]

    Permalink
    Definition Classes
    DataValidation
  7. def applyOneHotEncoding(featureColumns: Array[String], totalFields: Array[String]): (DataFrame, Array[String], Array[String])

    Permalink
  8. final def asInstanceOf[T0]: T0

    Permalink
    Definition Classes
    Any
  9. def clone(): AnyRef

    Permalink
    Attributes
    protected[java.lang]
    Definition Classes
    AnyRef
    Annotations
    @throws( ... )
  10. def convertDateAndTime(df: DataFrame, dateFields: List[String], timeFields: List[String], mode: String): (DataFrame, List[String])

    Permalink
    Definition Classes
    DataValidation
  11. final def eq(arg0: AnyRef): Boolean

    Permalink
    Definition Classes
    AnyRef
  12. def equals(arg0: Any): Boolean

    Permalink
    Definition Classes
    AnyRef → Any
  13. def finalize(): Unit

    Permalink
    Attributes
    protected[java.lang]
    Definition Classes
    AnyRef
    Annotations
    @throws( classOf[java.lang.Throwable] )
  14. def generateAssembly(numericColumns: List[String], characterColumns: List[String], featureCol: String): (Array[StringIndexer], Array[String], VectorAssembler)

    Permalink
    Definition Classes
    DataValidation
  15. def getCardinalityCheckMode: String

    Permalink
  16. def getCardinalityLimit: Int

    Permalink
  17. def getCardinalityPrecision: Double

    Permalink
  18. def getCardinalitySwitchSetting: Boolean

    Permalink
  19. def getCardinalityType: String

    Permalink
  20. final def getClass(): Class[_]

    Permalink
    Definition Classes
    AnyRef → Any
  21. def getDateTimeConversionType: String

    Permalink
  22. def getFeatureCol: String

    Permalink
  23. def getLabelCol: String

    Permalink
  24. def hashCode(): Int

    Permalink
    Definition Classes
    AnyRef → Any
  25. def indexStrings(categoricalFields: List[String]): (Array[StringIndexer], Array[String])

    Permalink
    Definition Classes
    DataValidation
  26. def invalidateSelection(value: String, allowances: Seq[String]): String

    Permalink
    Definition Classes
    DataValidation
  27. final def isInstanceOf[T0]: Boolean

    Permalink
    Definition Classes
    Any
  28. def makeFeaturePipeline(ignoreList: Array[String]): (DataFrame, Array[String], Array[String])

    Permalink

    Public method for creating a feature vector.

    Public method for creating a feature vector. Tasks that are covered:

    1. Checking types and ensuring that the label column specified in the config is present in the DataFrame 2. Separating numeric types from categorical types 3. Perform validation on categorical types for cardinality checks. 4. String Index available fields 5. Convert DateTime fields to numeric types 6. Assemble the indexers into a vector assembler to create the feature vector
    ignoreList

    Fields in the DataFrame to ignore for processing

    returns

    The Dataframe with a feature vector.

  29. final def ne(arg0: AnyRef): Boolean

    Permalink
    Definition Classes
    AnyRef
  30. final def notify(): Unit

    Permalink
    Definition Classes
    AnyRef
  31. final def notifyAll(): Unit

    Permalink
    Definition Classes
    AnyRef
  32. def oneHotEncodeStrings(stringIndexedFields: List[String]): (OneHotEncoderEstimator, Array[String])

    Permalink
    Definition Classes
    DataValidation
  33. def setCardinalityCheck(value: Boolean): FeaturePipeline.this.type

    Permalink
  34. def setCardinalityCheckMode(value: String): FeaturePipeline.this.type

    Permalink
  35. def setCardinalityLimit(value: Int): FeaturePipeline.this.type

    Permalink
  36. def setCardinalityPrecision(value: Double): FeaturePipeline.this.type

    Permalink
  37. def setCardinalityType(value: String): FeaturePipeline.this.type

    Permalink
  38. def setDateTimeConversionType(value: String): FeaturePipeline.this.type

    Permalink
  39. def setFeatureCol(value: String): FeaturePipeline.this.type

    Permalink
  40. def setLabelCol(value: String): FeaturePipeline.this.type

    Permalink
  41. final def synchronized[T0](arg0: ⇒ T0): T0

    Permalink
    Definition Classes
    AnyRef
  42. def toString(): String

    Permalink
    Definition Classes
    AnyRef → Any
  43. def validateCardinality(df: DataFrame, stringFields: List[String], cardinalityLimit: Int = 500, parallelism: Int = 20): ValidatedCategoricalFields

    Permalink
    Definition Classes
    DataValidation
  44. def validateFieldPresence(df: DataFrame, column: String): Unit

    Permalink
    Definition Classes
    DataValidation
  45. def validateInputDataframe(df: DataFrame): Unit

    Permalink
    Definition Classes
    DataValidation
  46. def validateLabelAndFeatures(df: DataFrame, labelCol: String, featureCol: String): Unit

    Permalink
    Definition Classes
    DataValidation
  47. final def wait(): Unit

    Permalink
    Definition Classes
    AnyRef
    Annotations
    @throws( ... )
  48. final def wait(arg0: Long, arg1: Int): Unit

    Permalink
    Definition Classes
    AnyRef
    Annotations
    @throws( ... )
  49. final def wait(arg0: Long): Unit

    Permalink
    Definition Classes
    AnyRef
    Annotations
    @throws( ... )

Inherited from DataValidation

Inherited from AnyRef

Inherited from Any

Ungrouped