public class StreamingStep extends Object
See also: Hadoop Streaming
 AWSCredentials credentials = new BasicAWSCredentials(accessKey, secretKey);
 AmazonElasticMapReduce emr = new AmazonElasticMapReduceClient(credentials);
 HadoopJarStepConfig config = new StreamingStep()
     .withInputs("s3://elasticmapreduce/samples/wordcount/input")
     .withOutput("s3://my-bucket/output/")
     .withMapper("s3://elasticmapreduce/samples/wordcount/wordSplitter.py")
     .withReducer("aggregate")
     .toHadoopJarStepConfig();
 StepConfig wordCount = new StepConfig()
     .withName("Word Count")
     .withActionOnFailure("TERMINATE_JOB_FLOW")
     .withHadoopJarStep(config);
 RunJobFlowRequest request = new RunJobFlowRequest()
     .withName("Word Count")
     .withSteps(wordCount)
     .withLogUri("s3://log-bucket/")
     .withInstances(new JobFlowInstancesConfig()
         .withEc2KeyName("keypairt")
         .withHadoopVersion("0.20")
         .withInstanceCount(5)
         .withKeepJobFlowAliveWhenNoSteps(true)
         .withMasterInstanceType("m1.small")
         .withSlaveInstanceType("m1.small"));
 RunJobFlowResult result = emr.runJobFlow(request);
 | Constructor and Description | 
|---|
StreamingStep()
Creates a new default StreamingStep. 
 | 
| Modifier and Type | Method and Description | 
|---|---|
Map<String,String> | 
getHadoopConfig()
Get the Hadoop config overrides (-D values). 
 | 
List<String> | 
getInputs()
Get list of step input paths. 
 | 
String | 
getMapper()
Get the mapper. 
 | 
String | 
getOutput()
Get output path. 
 | 
String | 
getReducer()
Get the reducer 
 | 
void | 
setHadoopConfig(Map<String,String> hadoopConfig)
Set the Hadoop config overrides (-D values). 
 | 
void | 
setInputs(Collection<String> inputs)
Set the list of step input paths. 
 | 
void | 
setMapper(String mapper)
Set the mapper. 
 | 
void | 
setOutput(String output)
Set the output path for this step. 
 | 
void | 
setReducer(String reducer)
Set the reducer 
 | 
HadoopJarStepConfig | 
toHadoopJarStepConfig()
Creates the final HadoopJarStepConfig once you are done configuring the step. 
 | 
StreamingStep | 
withHadoopConfig(String key,
                String value)
Add a Hadoop config override (-D value). 
 | 
StreamingStep | 
withInputs(String... inputs)
Add more input paths to this step. 
 | 
StreamingStep | 
withMapper(String mapper)
Set the mapper 
 | 
StreamingStep | 
withOutput(String output)
Set the output path for this step. 
 | 
StreamingStep | 
withReducer(String reducer)
Set the reducer 
 | 
public List<String> getInputs()
public void setInputs(Collection<String> inputs)
inputs - List of step inputs.public StreamingStep withInputs(String... inputs)
inputs - A list of inputs to this step.public String getOutput()
public void setOutput(String output)
output - Output path.public StreamingStep withOutput(String output)
output - Output pathpublic String getMapper()
public void setMapper(String mapper)
mapper - Mapperpublic StreamingStep withMapper(String mapper)
mapper - Mapperpublic String getReducer()
public void setReducer(String reducer)
reducer - Reducerpublic StreamingStep withReducer(String reducer)
reducer - Reducerpublic Map<String,String> getHadoopConfig()
public void setHadoopConfig(Map<String,String> hadoopConfig)
hadoopConfig - Hadoop config.public StreamingStep withHadoopConfig(String key, String value)
key - Hadoop configuration key.value - Configuration value.public HadoopJarStepConfig toHadoopJarStepConfig()
Copyright © 2017. All rights reserved.