Execution mode's defines how data is selected when running a data pipeline.
Partition difference execution mode lists partitions on input & output DataObject and starts loading all missing partitions.
Datatype for date columns in Hive
Set this environment dependent configurations at the beginning of the io.smartdatalake.app.SmartDataLakeBuilder implementation for your environment.
Hive conventions
Suffix used for alternating parquet HDFS paths (usually in TickTockHiveTableDataObject for integration layer)
Options for HDFS output
Column names specific to historization of Hive tables
Partition difference execution mode lists partitions on input & output DataObject and starts loading all missing partitions. Partition columns to be used for comparision need to be a common 'init' of input and output partition columns.
optional number of partition columns to use as a common 'init'.
optional selection of inputId to be used for partition comparision. Only needed if there are multiple input DataObject's.
optional selection of outputId to be used for partition comparision. Only needed if there are multiple output DataObject's.
optional restriction of the number of partition values per run.