See: Description
| Interface | Description |
|---|---|
| BatchReadable<KEY,VALUE> |
Interface for datasets that can be input to a batch job.
|
| BatchWritable<KEY,VALUE> |
Interface for the dataset that a batch job can output to.
|
| DatasetOutputCommitter |
This interface is implemented by a dataset if at the end of a batch job (MapReduce, Spark, ...)
the output needs to be committed, or rolled back, depending on success of the job.
|
| InputContext |
Exposes information about the input configured for this task.
|
| InputFormatProvider |
Interface to be implemented by datasets used as input to a MapReduce.
|
| OutputFormatProvider |
Interface to be implemented by datasets used as output of a MapReduce.
|
| PartitionedFileSetInputContext |
Exposes information about the
PartitionedFileSet input configured for this task. |
| RecordScannable<RECORD> |
Represents data sets that can be processed in batches, as series of records (as apposed to
key/value pairs).
|
| RecordWritable<RECORD> |
Interface for a dataset that a batch job can output to, as series of records (as apposed to
key/value pairs).
|
| Scannables.RecordMaker<KEY,VALUE,RECORD> |
Provides a way to convert a key and a value - as provided by a split reader - in to a single
record object.
|
| Class | Description |
|---|---|
| Input |
Defines input to a program, such as MapReduce.
|
| Input.DatasetInput | |
| Input.InputFormatProviderInput |
An implementation of
Input, which defines an InputFormatProvider as an input. |
| IteratorBasedSplitReader<KEY,VALUE> |
Handy implementation of
SplitReader backed by Iterator. |
| Output |
Defines output of a program, such as MapReduce.
|
| Output.DatasetOutput | |
| Output.OutputFormatProviderOutput |
An implementation of
Output, which defines an OutputFormatProvider as an
output. |
| RecordScanner<RECORD> |
Defines a reader of a dataset
Split. |
| Scannables |
Utility methods for record scanners.
|
| SimpleSplit |
Handy implementation of the
Split. |
| Split |
Defines split of the dataset.
|
| SplitReader<KEY,VALUE> |
Defines a reader of a dataset
Split. |
| SplitReaderAdapter<FROM_KEY,TO_KEY,FROM_VALUE,TO_VALUE> |
Handy adaptor for
SplitReader to convert types. |
| SplitReaderBase<KEY,VALUE> |
Provides an abstract implementation of
SplitReader. |
| Splits |
Utility class to serialize and deserialize
Split. |
Copyright © 2024 Cask Data, Inc. Licensed under the Apache License, Version 2.0.