com.github.mjakubowski84.parquet4s
Configuration of parquet writer.
Configuration of parquet writer. Please have a look at documentation of Parquet to understand what every configuration entry is responsible for. Apart from options specific for Parquet file format there are some other - what follows:
can be used to programmatically set Hadoop's Configuration
used when encoding time-based data, local machine's time zone is used by default
Writes iterable collection of data as a Parquet files at given path.
Writes iterable collection of data as a Parquet files at given path. Path can represent local file or directory, HDFS, AWS S3, Google Storage, Azure, etc. Please refer to Hadoop client documentation or your data provider in order to know how to configure the connection.
type of data, will be used also to resolve the schema of Parquet files
URI where the data will be written to
Collection of T> that will be written in Parquet file format
configuration of writer, see ParquetWriter.Options
ParquetWriterFactory that will be used to create an instance of writer
Default instance of ParquetWriterFactory