public class EsInputFormat<K,V> extends InputFormat<K,V> implements InputFormat<K,V>
InputFormat
for streaming data (typically based on a query) from ElasticSearch.
Returns the document ID as key and its content as value.
This class implements both the "old" (org.apache.hadoop.mapred) and the "new" (org.apache.hadoop.mapreduce) API.Modifier and Type | Class and Description |
---|---|
protected static class |
EsInputFormat.AbstractWritableEsInputRecordReader<V> |
protected static class |
EsInputFormat.EsInputRecordReader<K,V> |
protected static class |
EsInputFormat.EsInputSplit |
protected static class |
EsInputFormat.JsonWritableEsInputRecordReader |
protected static class |
EsInputFormat.WritableEsInputRecordReader |
Constructor and Description |
---|
EsInputFormat() |
Modifier and Type | Method and Description |
---|---|
EsInputFormat.EsInputRecordReader<K,V> |
createRecordReader(InputSplit split,
TaskAttemptContext context) |
EsInputFormat.EsInputRecordReader<K,V> |
getRecordReader(InputSplit split,
JobConf job,
Reporter reporter) |
InputSplit[] |
getSplits(JobConf job,
int numSplits) |
List<InputSplit> |
getSplits(JobContext context) |
protected boolean |
isOutputAsJson(Configuration cfg) |
public List<InputSplit> getSplits(JobContext context) throws IOException
getSplits
in class InputFormat<K,V>
IOException
public EsInputFormat.EsInputRecordReader<K,V> createRecordReader(InputSplit split, TaskAttemptContext context)
createRecordReader
in class InputFormat<K,V>
public InputSplit[] getSplits(JobConf job, int numSplits) throws IOException
getSplits
in interface InputFormat<K,V>
IOException
public EsInputFormat.EsInputRecordReader<K,V> getRecordReader(InputSplit split, JobConf job, Reporter reporter)
getRecordReader
in interface InputFormat<K,V>
protected boolean isOutputAsJson(Configuration cfg)