the name of the Excel Sheet to read from/write to. This option is required.
the number of rows in the excel spreadsheet to skip before any data is read. This option must not be set for writing.
the first column in the specified Excel Sheet to read from (1-based indexing). This option must not be set for writing.
TODO: this is not used anymore as far as I can tell --> crealytics now uses dataAddress.
Limit the number of rows being returned on read to the first rowLimit
rows.
This is applied after numLinesToSkip
.
If true
, the first row of the excel sheet specifies the column names.
This option is required (default: true).
Empty cells are parsed as null
values (default: true).
Infer the schema of the excel sheet automatically (default: true).
A format string specifying the format to use when writing timestamps (default: dd-MM-yyyy HH:mm:ss).
A format string specifying the format to use when writing dates.
The number of rows that are stored in memory. If set, a streaming reader is used which can help with big files.
Sample size for schema inference.
A format string specifying the format to use when writing dates.
TODO: this is not used anymore as far as I can tell --> crealytics now uses dataAddress.
Sample size for schema inference.
Infer the schema of the excel sheet automatically (default: true).
The number of rows that are stored in memory.
The number of rows that are stored in memory. If set, a streaming reader is used which can help with big files.
the number of rows in the excel spreadsheet to skip before any data is read.
the number of rows in the excel spreadsheet to skip before any data is read. This option must not be set for writing.
Limit the number of rows being returned on read to the first rowLimit
rows.
Limit the number of rows being returned on read to the first rowLimit
rows.
This is applied after numLinesToSkip
.
the name of the Excel Sheet to read from/write to.
the name of the Excel Sheet to read from/write to. This option is required.
the first column in the specified Excel Sheet to read from (1-based indexing).
the first column in the specified Excel Sheet to read from (1-based indexing). This option must not be set for writing.
A format string specifying the format to use when writing timestamps (default: dd-MM-yyyy HH:mm:ss).
Empty cells are parsed as null
values (default: true).
If true
, the first row of the excel sheet specifies the column names.
If true
, the first row of the excel sheet specifies the column names.
This option is required (default: true).
Options passed to org.apache.spark.sql.DataFrameReader and org.apache.spark.sql.DataFrameWriter for reading and writing Microsoft Excel files. Excel support is provided by the spark-excel project (see link below).
the name of the Excel Sheet to read from/write to. This option is required.
the number of rows in the excel spreadsheet to skip before any data is read. This option must not be set for writing.
the first column in the specified Excel Sheet to read from (1-based indexing). This option must not be set for writing.
TODO: this is not used anymore as far as I can tell --> crealytics now uses dataAddress.
Limit the number of rows being returned on read to the first
rowLimit
rows. This is applied afternumLinesToSkip
.If
true
, the first row of the excel sheet specifies the column names. This option is required (default: true).Empty cells are parsed as
null
values (default: true).Infer the schema of the excel sheet automatically (default: true).
A format string specifying the format to use when writing timestamps (default: dd-MM-yyyy HH:mm:ss).
A format string specifying the format to use when writing dates.
The number of rows that are stored in memory. If set, a streaming reader is used which can help with big files.
Sample size for schema inference.
https://github.com/crealytics/spark-excel