com.github.mjakubowski84.parquet4s
Configuration settings that are used during decoding or reading Parquet files
Configuration settings that are used during decoding or reading Parquet files
set it to TimeZone which was used to encode time-based data that you want to read; machine's time zone is used by default
use it to programmatically override Hadoop's Configuration
Creates new ParquetIterable over data from given path.
Creates new ParquetIterable over data from given path.
Path can represent local file or directory, HDFS, AWS S3, Google Storage, Azure, etc.
Please refer to Hadoop client documentation or your data provider in order to know how to configure the connection.
type of data that represents the schema of the Parquet file, e.g.:
case class MyData(id: Long, name: String, created: java.sql.Timestamp)
URI to Parquet files, e.g.:
"file:///data/users"
configuration of how Parquet files should be read
optional before-read filtering; no filtering is applied by default; check Filter for more details
Remember to call close()
on iterable in order to free resources!
Default implementation of ParquetReader.
(Since version 0.3.0) Please use read function or ParquetReader type class