Package | Description |
---|---|
org.apache.hadoop.mapred | |
org.apache.hadoop.mapred.join | |
org.apache.hadoop.mapred.lib | |
org.apache.hadoop.mapred.lib.db |
Modifier and Type | Class and Description |
---|---|
class |
FileInputFormat<K,V>
A base class for file-based
InputFormat . |
class |
FixedLengthInputFormat
FixedLengthInputFormat is an input format used to read input files
which contain fixed length records.
|
class |
KeyValueTextInputFormat
An
InputFormat for plain text files. |
class |
MultiFileInputFormat<K,V>
An abstract
InputFormat that returns MultiFileSplit 's
in MultiFileInputFormat.getSplits(JobConf, int) method. |
class |
SequenceFileAsBinaryInputFormat
InputFormat reading keys, values from SequenceFiles in binary (raw)
format.
|
class |
SequenceFileAsTextInputFormat
This class is similar to SequenceFileInputFormat,
except it generates SequenceFileAsTextRecordReader
which converts the input keys and values to their
String forms by calling toString() method.
|
class |
SequenceFileInputFilter<K,V>
A class that allows a map/red job to work on a sample of sequence files.
|
class |
SequenceFileInputFormat<K,V>
An
InputFormat for SequenceFile s. |
class |
TextInputFormat
An
InputFormat for plain text files. |
Modifier and Type | Method and Description |
---|---|
InputFormat |
JobConf.getInputFormat()
Get the
InputFormat implementation for the map-reduce job,
defaults to TextInputFormat if not specified explicity. |
Modifier and Type | Method and Description |
---|---|
void |
JobConf.setInputFormat(Class<? extends InputFormat> theClass)
Set the
InputFormat implementation for the map-reduce job. |
Modifier and Type | Interface and Description |
---|---|
interface |
ComposableInputFormat<K extends org.apache.hadoop.io.WritableComparable,V extends org.apache.hadoop.io.Writable>
Refinement of InputFormat requiring implementors to provide
ComposableRecordReader instead of RecordReader.
|
Modifier and Type | Class and Description |
---|---|
class |
CompositeInputFormat<K extends org.apache.hadoop.io.WritableComparable>
An InputFormat capable of performing joins over a set of data sources sorted
and partitioned the same way.
|
static class |
Parser.Node |
Modifier and Type | Method and Description |
---|---|
static String |
CompositeInputFormat.compose(Class<? extends InputFormat> inf,
String path)
Convenience method for constructing composite formats.
|
static String |
CompositeInputFormat.compose(String op,
Class<? extends InputFormat> inf,
org.apache.hadoop.fs.Path... path)
Convenience method for constructing composite formats.
|
static String |
CompositeInputFormat.compose(String op,
Class<? extends InputFormat> inf,
String... path)
Convenience method for constructing composite formats.
|
Modifier and Type | Class and Description |
---|---|
class |
CombineFileInputFormat<K,V>
|
class |
CombineSequenceFileInputFormat<K,V>
Input format that is a
CombineFileInputFormat -equivalent for
SequenceFileInputFormat . |
class |
CombineTextInputFormat
Input format that is a
CombineFileInputFormat -equivalent for
TextInputFormat . |
class |
DelegatingInputFormat<K,V>
An
InputFormat that delegates behaviour of paths to multiple other
InputFormats. |
class |
NLineInputFormat
NLineInputFormat which splits N lines of input as one split.
|
Modifier and Type | Method and Description |
---|---|
K[] |
InputSampler.Sampler.getSample(InputFormat<K,V> inf,
JobConf job)
For a given job, collect and return a subset of the keys from the
input data.
|
K[] |
InputSampler.SplitSampler.getSample(InputFormat<K,V> inf,
JobConf job)
From each split sampled, take the first numSamples / numSplits records.
|
K[] |
InputSampler.RandomSampler.getSample(InputFormat<K,V> inf,
JobConf job)
Randomize the split order, then take the specified number of keys from
each split sampled, where each key is selected with the specified
probability and possibly replaced by a subsequently selected key when
the quota of keys from that split is satisfied.
|
K[] |
InputSampler.IntervalSampler.getSample(InputFormat<K,V> inf,
JobConf job)
For each split sampled, emit when the ratio of the number of records
retained to the total record count is less than the specified
frequency.
|
Modifier and Type | Method and Description |
---|---|
static void |
MultipleInputs.addInputPath(JobConf conf,
org.apache.hadoop.fs.Path path,
Class<? extends InputFormat> inputFormatClass)
Add a
Path with a custom InputFormat to the list of
inputs for the map-reduce job. |
static void |
MultipleInputs.addInputPath(JobConf conf,
org.apache.hadoop.fs.Path path,
Class<? extends InputFormat> inputFormatClass,
Class<? extends Mapper> mapperClass)
|
Modifier and Type | Class and Description |
---|---|
class |
DBInputFormat<T extends DBWritable> |
Copyright © 2017 Apache Software Foundation. All Rights Reserved.