Class

com.databricks.labs.automl.sanitize

FeatureCorrelationDetection

Related Doc: package sanitize

Permalink

class FeatureCorrelationDetection extends SparkSessionWrapper

Linear Supertypes
SparkSessionWrapper, Serializable, Serializable, AnyRef, Any
Ordering
  1. Alphabetic
  2. By Inheritance
Inherited
  1. FeatureCorrelationDetection
  2. SparkSessionWrapper
  3. Serializable
  4. Serializable
  5. AnyRef
  6. Any
  1. Hide All
  2. Show All
Visibility
  1. Public
  2. All

Instance Constructors

  1. new FeatureCorrelationDetection(data: DataFrame, fieldListing: Array[String])

    Permalink

Value Members

  1. final def !=(arg0: Any): Boolean

    Permalink
    Definition Classes
    AnyRef → Any
  2. final def ##(): Int

    Permalink
    Definition Classes
    AnyRef → Any
  3. final def ==(arg0: Any): Boolean

    Permalink
    Definition Classes
    AnyRef → Any
  4. final def asInstanceOf[T0]: T0

    Permalink
    Definition Classes
    Any
  5. def buildFeaturePairs(): Array[FieldPairs]

    Permalink

    Create the left/right testing pairs to be used in determining correlation between feature fields

    Create the left/right testing pairs to be used in determining correlation between feature fields

    returns

    Array of distinct pairs of feature fields to test

    Since

    0.6.2

  6. def calculateFeatureCorrelation: Array[FieldCorrelationPayload]

    Permalink

    Method for calculating all of the pairwise correlation calculations for the feature fields

    Method for calculating all of the pairwise correlation calculations for the feature fields

    returns

    Array of FieldCorrelationPayload data (left/right name pairs and the correlation value)

    Since

    0.6.2

  7. def clone(): AnyRef

    Permalink
    Attributes
    protected[java.lang]
    Definition Classes
    AnyRef
    Annotations
    @throws( ... )
  8. def determineFieldsToDrop: FieldRemovalPayload

    Permalink

    Method for determining which columns need to be dropped from the feature set based on the correlation cutoff settings

    Method for determining which columns need to be dropped from the feature set based on the correlation cutoff settings

    returns

    FieldRemovalPayload that contains the removal and retain fields.

    Annotations
    @throws( classOf[RuntimeException] )
    Since

    0.6.2

    Exceptions thrown

    FeatureCorrelationException: totalFields, removedFields

  9. final def eq(arg0: AnyRef): Boolean

    Permalink
    Definition Classes
    AnyRef
  10. def equals(arg0: Any): Boolean

    Permalink
    Definition Classes
    AnyRef → Any
  11. def filterFeatureCorrelation(): DataFrame

    Permalink
  12. def finalize(): Unit

    Permalink
    Attributes
    protected[java.lang]
    Definition Classes
    AnyRef
    Annotations
    @throws( classOf[java.lang.Throwable] )
  13. def generateFeatureCorrelationReport: DataFrame

    Permalink

    Debug method to allow for an inspection of the correlation between each feature value to one another

    Debug method to allow for an inspection of the correlation between each feature value to one another

    returns

    DataFrame that contains the pair information and the correlation values of those pairs

    Since

    0.6.2

  14. final def getClass(): Class[_]

    Permalink
    Definition Classes
    AnyRef → Any
  15. def getCorrelationCutoffHigh: Double

    Permalink
  16. def getCorrelationCutoffLow: Double

    Permalink
  17. def getLabelCol: String

    Permalink
  18. def getParallelism: Int

    Permalink
  19. def hashCode(): Int

    Permalink
    Definition Classes
    AnyRef → Any
  20. final def isInstanceOf[T0]: Boolean

    Permalink
    Definition Classes
    Any
  21. final def ne(arg0: AnyRef): Boolean

    Permalink
    Definition Classes
    AnyRef
  22. final def notify(): Unit

    Permalink
    Definition Classes
    AnyRef
  23. final def notifyAll(): Unit

    Permalink
    Definition Classes
    AnyRef
  24. lazy val sc: SparkContext

    Permalink
    Definition Classes
    SparkSessionWrapper
  25. def setCorrelationCutoffHigh(value: Double): FeatureCorrelationDetection.this.type

    Permalink
  26. def setCorrelationCutoffLow(value: Double): FeatureCorrelationDetection.this.type

    Permalink
  27. def setLabelCol(value: String): FeatureCorrelationDetection.this.type

    Permalink
  28. def setParallelism(value: Int): FeatureCorrelationDetection.this.type

    Permalink
  29. lazy val spark: SparkSession

    Permalink
    Definition Classes
    SparkSessionWrapper
  30. final def synchronized[T0](arg0: ⇒ T0): T0

    Permalink
    Definition Classes
    AnyRef
  31. def toString(): String

    Permalink
    Definition Classes
    AnyRef → Any
  32. final def wait(): Unit

    Permalink
    Definition Classes
    AnyRef
    Annotations
    @throws( ... )
  33. final def wait(arg0: Long, arg1: Int): Unit

    Permalink
    Definition Classes
    AnyRef
    Annotations
    @throws( ... )
  34. final def wait(arg0: Long): Unit

    Permalink
    Definition Classes
    AnyRef
    Annotations
    @throws( ... )

Inherited from SparkSessionWrapper

Inherited from Serializable

Inherited from Serializable

Inherited from AnyRef

Inherited from Any

Ungrouped