ParquetWriter

Type Members

case class Options(writeMode: Mode = ParquetFileWriter.Mode.CREATE, compressionCodecName: CompressionCodecName = ..., dictionaryEncodingEnabled: Boolean = ..., dictionaryPageSize: Int = ..., maxPaddingSize: Int = ..., pageSize: Int = ..., rowGroupSize: Int = ..., validationEnabled: Boolean = ..., hadoopConf: Configuration = new Configuration(), timeZone: TimeZone = TimeZone.getDefault) extends Product with Serializable

Configuration of parquet writer.
Configuration of parquet writer. Please have a look at documentation of Parquet to understand what every configuration entry is responsible for. Apart from options specific for Parquet file format there are some other - what follows:
hadoopConf
can be used to programmatically set Hadoop's Configuration
timeZone
used when encoding time-based data, local machine's time zone is used by default
type ParquetWriterFactory[T] = (String, Options) ⇒ ParquetWriter[T]

Value Members

final def !=(arg0: Any): Boolean

Definition Classes
AnyRef → Any
final def ##(): Int

Definition Classes
AnyRef → Any
final def ==(arg0: Any): Boolean

Definition Classes
AnyRef → Any
final def asInstanceOf[T0]: T0

Definition Classes
Any
def clone(): AnyRef

Attributes
protected[java.lang]
Definition Classes
AnyRef
Annotations
@throws( ... )
final def eq(arg0: AnyRef): Boolean

Definition Classes
AnyRef
def equals(arg0: Any): Boolean

Definition Classes
AnyRef → Any
def finalize(): Unit

Attributes
protected[java.lang]
Definition Classes
AnyRef
Annotations
@throws( classOf[java.lang.Throwable] )
final def getClass(): Class[_]

Definition Classes
AnyRef → Any
def hashCode(): Int

Definition Classes
AnyRef → Any
final def isInstanceOf[T0]: Boolean

Definition Classes
Any
final def ne(arg0: AnyRef): Boolean

Definition Classes
AnyRef
final def notify(): Unit

Definition Classes
AnyRef
final def notifyAll(): Unit

Definition Classes
AnyRef
final def synchronized[T0](arg0: ⇒ T0): T0

Definition Classes
AnyRef
def toString(): String

Definition Classes
AnyRef → Any
final def wait(): Unit

Definition Classes
AnyRef
Annotations
@throws( ... )
final def wait(arg0: Long, arg1: Int): Unit

Definition Classes
AnyRef
Annotations
@throws( ... )
final def wait(arg0: Long): Unit

Definition Classes
AnyRef
Annotations
@throws( ... )
def writeAndClose[T](path: String, data: Iterable[T], options: Options = ParquetWriter.Options())(implicit writerFactory: ParquetWriterFactory[T]): Unit

Writes iterable collection of data as a Parquet files at given path.
Writes iterable collection of data as a Parquet files at given path. Path can represent local file or directory, HDFS, AWS S3, Google Storage, Azure, etc. Please refer to Hadoop client documentation or your data provider in order to know how to configure the connection.
T
type of data, will be used also to resolve the schema of Parquet files
path
URI where the data will be written to
data
Collection of T that will be written in Parquet file format
options
configuration of writer, see ParquetWriter.Options
writerFactory
ParquetWriterFactory that will be used to create an instance of writer
def writer[T](path: String, options: Options = ParquetWriter.Options())(implicit writerFactory: ParquetWriterFactory[T]): ParquetWriter[T]
implicit def writerFactory[T](implicit arg0: ParquetRecordEncoder[T], arg1: ParquetSchemaResolver[T]): ParquetWriterFactory[T]

Default instance of ParquetWriterFactory

Inherited from AnyRef

Inherited from Any

Ungrouped

Related Docs: trait ParquetWriter | package parquet4s

object ParquetWriter

Type Members

type ParquetWriterFactory[T] = (String, Options) ⇒ ParquetWriter[T]

Value Members

final def !=(arg0: Any): Boolean

final def ##(): Int

final def ==(arg0: Any): Boolean

final def asInstanceOf[T0]: T0

def clone(): AnyRef

final def eq(arg0: AnyRef): Boolean

def equals(arg0: Any): Boolean

def finalize(): Unit

final def getClass(): Class[_]

def hashCode(): Int

final def isInstanceOf[T0]: Boolean

final def ne(arg0: AnyRef): Boolean

final def notify(): Unit

final def notifyAll(): Unit

final def synchronized[T0](arg0: ⇒ T0): T0

def toString(): String

final def wait(): Unit

final def wait(arg0: Long, arg1: Int): Unit

final def wait(arg0: Long): Unit

def writeAndClose[T](path: String, data: Iterable[T], options: Options = ParquetWriter.Options())(implicit writerFactory: ParquetWriterFactory[T]): Unit

def writer[T](path: String, options: Options = ParquetWriter.Options())(implicit writerFactory: ParquetWriterFactory[T]): ParquetWriter[T]

implicit def writerFactory[T](implicit arg0: ParquetRecordEncoder[T], arg1: ParquetSchemaResolver[T]): ParquetWriterFactory[T]

Inherited from AnyRef

Inherited from Any

Ungrouped