List of transform tasks to execute
Area where the data is located. When using the BigQuery engine, teh area corresponds to the dataset name we will be working on in this job. When using the Spark engine, this is folder where the data should be store. Default value is "business"
output file format when using Spark engine. Ingored for BigQuery. Default value is "parquet"
When outputting files, should we coalesce it to a single file. Useful when CSV is the output format.
: Register UDFs written in this JVM class when using Spark engine Register UDFs stored at this location when using BigQuery engine
: Create temporary views using where the key is the view name and the map the SQL request corresponding to this view using the SQL engine supported syntax.
: SPARK or BQ. Default value is SPARK.
Area where the data is located.
Area where the data is located. When using the BigQuery engine, teh area corresponds to the dataset name we will be working on in this job. When using the Spark engine, this is folder where the data should be store. Default value is "business"
When outputting files, should we coalesce it to a single file.
When outputting files, should we coalesce it to a single file. Useful when CSV is the output format.
: SPARK or BQ.
: SPARK or BQ. Default value is SPARK.
output file format when using Spark engine.
output file format when using Spark engine. Ingored for BigQuery. Default value is "parquet"
List of transform tasks to execute
: Register UDFs written in this JVM class when using Spark engine Register UDFs stored at this location when using BigQuery engine
: Create temporary views using where the key is the view name and the map the SQL request corresponding to this view using the SQL engine supported syntax.
A job is a set of transform tasks executed using the specified engine.
List of transform tasks to execute
Area where the data is located. When using the BigQuery engine, teh area corresponds to the dataset name we will be working on in this job. When using the Spark engine, this is folder where the data should be store. Default value is "business"
output file format when using Spark engine. Ingored for BigQuery. Default value is "parquet"
When outputting files, should we coalesce it to a single file. Useful when CSV is the output format.
: Register UDFs written in this JVM class when using Spark engine Register UDFs stored at this location when using BigQuery engine
: Create temporary views using where the key is the view name and the map the SQL request corresponding to this view using the SQL engine supported syntax.
: SPARK or BQ. Default value is SPARK.