Package

io.smartdatalake

definitions

Permalink

package definitions

Visibility
  1. Public
  2. All

Type Members

  1. sealed trait ExecutionMode extends AnyRef

    Permalink

    Execution mode's defines how data is selected when running a data pipeline.

  2. case class PartitionDiffMode(partitionColNb: Option[Int] = None, mainInputId: Option[String] = None, mainOutputId: Option[String] = None, nbOfPartitionValuesPerRun: Option[Int] = None) extends ExecutionMode with Product with Serializable

    Permalink

    Partition difference execution mode lists partitions on input & output DataObject and starts loading all missing partitions.

    Partition difference execution mode lists partitions on input & output DataObject and starts loading all missing partitions. Partition columns to be used for comparision need to be a common 'init' of input and output partition columns.

    partitionColNb

    optional number of partition columns to use as a common 'init'.

    mainInputId

    optional selection of inputId to be used for partition comparision. Only needed if there are multiple input DataObject's.

    mainOutputId

    optional selection of outputId to be used for partition comparision. Only needed if there are multiple output DataObject's.

    nbOfPartitionValuesPerRun

    optional restriction of the number of partition values per run.

Value Members

  1. object DateColumnType extends Enumeration

    Permalink

    Datatype for date columns in Hive

  2. object Environment

    Permalink

    Set this environment dependent configurations at the beginning of the io.smartdatalake.app.SmartDataLakeBuilder implementation for your environment.

  3. object HiveConventions

    Permalink

    Hive conventions

  4. object HiveTableLocationSuffix extends Enumeration

    Permalink

    Suffix used for alternating parquet HDFS paths (usually in TickTockHiveTableDataObject for integration layer)

  5. object OutputType extends Enumeration

    Permalink

    Options for HDFS output

  6. object TechnicalTableColumn extends Enumeration

    Permalink

    Column names specific to historization of Hive tables

Ungrouped