Package

com.databricks.labs.automl

pipeline

Permalink

package pipeline

Visibility
  1. Public
  2. All

Type Members

  1. abstract class AbstractTransformer extends Transformer with HasAutoMlIdColumn with HasDebug with HasPipelineId

    Permalink

  2. class AutoMlOutputDatasetTransformer extends AbstractTransformer with DefaultParamsWritable with HasLabelColumn with HasFeaturesColumns

    Permalink

  3. class CardinalityLimitColumnPrunerTransformer extends AbstractTransformer with DefaultParamsWritable with HasLabelColumn with HasTransformCalculated

    Permalink

  4. class ColumnNameTransformer extends Transformer with DefaultParamsWritable with HasInputCols with HasOutputCols with HasDebug with HasPipelineId

    Permalink

  5. class CovarianceFilterTransformer extends AbstractTransformer with DefaultParamsWritable with HasLabelColumn with HasFeaturesColumns with HasFieldsRemoved with HasTransformCalculated with HasFeatureColumn

    Permalink

  6. class DataSanitizerTransformer extends AbstractTransformer with DefaultParamsWritable with HasLabelColumn with HasFeatureColumn

    Permalink

  7. class DatasetsUnionTransformer extends AbstractTransformer with DefaultParamsWritable

    Permalink

  8. class DateFieldTransformer extends AbstractTransformer with DefaultParamsWritable with DataValidation with HasLabelColumn

    Permalink

  9. class DropColumnsTransformer extends AbstractTransformer with DefaultParamsWritable with HasInputCols

    Permalink

  10. class DropTempTableTransformer extends AbstractTransformer with DefaultParamsWritable with WithNoopsStage

    Permalink

  11. final case class FeatureEngineeringOutput(pipelineModel: PipelineModel, originalDfViewName: String, decidedModel: String, transformedForTrainingDf: DataFrame) extends Product with Serializable

    Permalink
  12. class FeaturePipeline extends DataValidation

    Permalink
  13. trait HasAutoMlIdColumn extends Params

    Permalink

  14. trait HasDebug extends Params

    Permalink

    Base trait for setting/accessing debug flags.

    Base trait for setting/accessing debug flags. Meant to be extended by all pipeline stages, which inherit pipeline stage logging by default

  15. trait HasFeatureColumn extends Params

    Permalink

  16. trait HasFeaturesColumns extends Params

    Permalink

  17. trait HasFieldsRemoved extends Params

    Permalink

  18. trait HasFieldsToIgnore extends Params

    Permalink

  19. trait HasInteractionColumns extends Params

    Permalink

    Trait for defining whether interaction columns have been set for the application of Feature Interactions

    Trait for defining whether interaction columns have been set for the application of Feature Interactions

    Since

    0.6.2

  20. trait HasLabelColumn extends Params

    Permalink

  21. trait HasPipelineId extends Params

    Permalink

    Since

    0.6.1 trait for decorating all pipeline stages with pipeline ID. Helpful when troubleshooting logs with a given pipeline ID (eg, fetched from MLflow)

  22. trait HasTransformCalculated extends Params

    Permalink

  23. class InteractionTransformer extends AbstractTransformer with DefaultParamsWritable with HasInteractionColumns

    Permalink

    Transformer for creating interacted feature fields based on FeatureInteraction module

    Transformer for creating interacted feature fields based on FeatureInteraction module

    Since

    0.6.2

  24. trait IsTrainingStage extends AnyRef

    Permalink

  25. class MlFlowLoggingValidationStageTransformer extends AbstractTransformer with DefaultParamsWritable with WithNoopsStage

    Permalink

  26. class OutlierFilterTransformer extends AbstractTransformer with DefaultParamsWritable with HasLabelColumn with HasFieldsToIgnore with IsTrainingStage

    Permalink

  27. class PearsonFilterTransformer extends AbstractTransformer with DefaultParamsWritable with HasLabelColumn with HasFeatureColumn with HasFeaturesColumns with HasFieldsRemoved with HasTransformCalculated

    Permalink

  28. class RegisterTempTableTransformer extends AbstractTransformer with DefaultParamsWritable

    Permalink

  29. class RepartitionTransformer extends AbstractTransformer with WithNoopsStage with DefaultParamsWritable with HasDebug

    Permalink

    A WithNoopsStage transformer stage that is helpful to repartition a DataFrame coming out of any pipeline stage

  30. class RoundUpDoubleTransformer extends AbstractTransformer with DefaultParamsWritable with HasTransformCalculated with HasInputCols

    Permalink

  31. class SQLWrapperTransformer extends AbstractTransformer with DefaultParamsWritable

    Permalink

  32. class SyntheticFeatureGenTransformer extends AbstractTransformer with HasLabelColumn with HasFeatureColumn with HasFieldsToIgnore with DefaultParamsWritable with IsTrainingStage

    Permalink

  33. class VarianceFilterTransformer extends AbstractTransformer with DefaultParamsWritable with HasLabelColumn with HasFeatureColumn with HasTransformCalculated

    Permalink

  34. final case class VectorizationOutput(pipelineModel: PipelineModel, vectorizedCols: Array[String]) extends Product with Serializable

    Permalink
  35. trait WithNoopsStage extends AnyRef

    Permalink

    Marker interface to signify any transformer extending this trait will not alter an input dataset.

    Marker interface to signify any transformer extending this trait will not alter an input dataset. This is only for the edge cases where it is required to do an external Ops before pipeline execution can continue. An example would be to do Mlflow params Validation before training continues. Helpful in scenarios where fail-fast feature is needed Example transformers are DropTempTableTransformer, MlFlowLoggingValidationStageTransformer.

    NOTE: Noops implies no changes to the input Dataset, but the implementation can result in a change to an external state

  36. class ZipRegisterTempTransformer extends AbstractTransformer with DefaultParamsWritable with HasLabelColumn with HasFeaturesColumns

    Permalink

Value Members

  1. object AutoMlOutputDatasetTransformer extends DefaultParamsReadable[AutoMlOutputDatasetTransformer] with Serializable

    Permalink
  2. object CardinalityLimitColumnPrunerTransformer extends DefaultParamsReadable[CardinalityLimitColumnPrunerTransformer] with Serializable

    Permalink
  3. object ColumnNameTransformer extends DefaultParamsReadable[ColumnNameTransformer] with Serializable

    Permalink
  4. object CovarianceFilterTransformer extends DefaultParamsReadable[CovarianceFilterTransformer] with Serializable

    Permalink
  5. object DataSanitizerTransformer extends DefaultParamsReadable[DataSanitizerTransformer] with Serializable

    Permalink
  6. object DatasetsUnionTransformer extends DefaultParamsReadable[DatasetsUnionTransformer] with Serializable

    Permalink
  7. object DateFieldTransformer extends DefaultParamsReadable[DateFieldTransformer] with Serializable

    Permalink
  8. object DropColumnsTransformer extends DefaultParamsReadable[DropColumnsTransformer] with Serializable

    Permalink
  9. object DropTempTableTransformer extends DefaultParamsReadable[DropTempTableTransformer] with Serializable

    Permalink
  10. object FeatureEngineeringPipelineContext

    Permalink
  11. object InteractionTransformer extends DefaultParamsReadable[InteractionTransformer] with Serializable

    Permalink
  12. object MlFlowLoggingValidationStageTransformer extends DefaultParamsReadable[MlFlowLoggingValidationStageTransformer] with Serializable

    Permalink
  13. object OutlierFilterTransformer extends DefaultParamsReadable[OutlierFilterTransformer] with Serializable

    Permalink
  14. object PearsonFilterTransformer extends DefaultParamsReadable[PearsonFilterTransformer] with Serializable

    Permalink
  15. object PipelineEnums extends Enumeration

    Permalink
  16. object PipelineMlFlowProgressReporter

    Permalink

    Since

    0.6.1 Utility for reporting pipeline progress to MLflow

  17. object PipelineStateCache

    Permalink

  18. object PipelineVars extends Enumeration

    Permalink
  19. object RegisterTempTableTransformer extends DefaultParamsReadable[RegisterTempTableTransformer] with Serializable

    Permalink
  20. object RepartitionTransformer extends DefaultParamsReadable[RepartitionTransformer] with Serializable

    Permalink
  21. object RoundUpDoubleTransformer extends DefaultParamsReadable[RoundUpDoubleTransformer] with Serializable

    Permalink
  22. object SQLWrapperTransformer extends DefaultParamsReadable[SQLWrapperTransformer] with Serializable

    Permalink
  23. object SyntheticFeatureGenTransformer extends DefaultParamsReadable[SyntheticFeatureGenTransformer] with Serializable

    Permalink
  24. object VarianceFilterTransformer extends DefaultParamsReadable[VarianceFilterTransformer] with Serializable

    Permalink
  25. object ZipRegisterTempTransformer extends DefaultParamsReadable[ZipRegisterTempTransformer] with Serializable

    Permalink
  26. package inference

    Permalink

Ungrouped