@InterfaceAudience.Public @InterfaceStability.Stable public class SequenceFileInputFilter<K,V> extends SequenceFileInputFormat<K,V>
Modifier and Type | Class and Description |
---|---|
static interface |
SequenceFileInputFilter.Filter
filter interface
|
static class |
SequenceFileInputFilter.FilterBase
base class for Filters
|
static class |
SequenceFileInputFilter.MD5Filter
This class returns a set of records by examing the MD5 digest of its
key against a filtering frequency f.
|
static class |
SequenceFileInputFilter.PercentFilter
This class returns a percentage of records
The percentage is determined by a filtering frequency f using
the criteria record# % f == 0.
|
static class |
SequenceFileInputFilter.RegexFilter
Records filter by matching key to regex
|
FileInputFormat.Counter
INPUT_DIR_RECURSIVE, LOG, NUM_INPUT_FILES
Constructor and Description |
---|
SequenceFileInputFilter() |
Modifier and Type | Method and Description |
---|---|
RecordReader<K,V> |
getRecordReader(InputSplit split,
JobConf job,
Reporter reporter)
Create a record reader for the given split
|
static void |
setFilterClass(org.apache.hadoop.conf.Configuration conf,
Class filterClass)
set the filter class
|
listStatus
addInputPath, addInputPathRecursively, addInputPaths, computeSplitSize, getBlockIndex, getInputPathFilter, getInputPaths, getSplitHosts, getSplits, isSplitable, makeSplit, makeSplit, setInputPathFilter, setInputPaths, setInputPaths, setMinSplitSize
public RecordReader<K,V> getRecordReader(InputSplit split, JobConf job, Reporter reporter) throws IOException
getRecordReader
in interface InputFormat<K,V>
getRecordReader
in class SequenceFileInputFormat<K,V>
split
- file splitjob
- job configurationreporter
- reporter who sends report to task trackerIOException
public static void setFilterClass(org.apache.hadoop.conf.Configuration conf, Class filterClass)
conf
- application configurationfilterClass
- filter classCopyright © 2020 Apache Software Foundation. All rights reserved.