@InterfaceAudience.Public
@InterfaceStability.Stable
public class InputSampler<K,V>
extends org.apache.hadoop.conf.Configured
implements org.apache.hadoop.util.Tool
TotalOrderPartitioner
.Modifier and Type | Class and Description |
---|---|
static class |
InputSampler.IntervalSampler<K,V>
Sample from s splits at regular intervals.
|
static class |
InputSampler.RandomSampler<K,V>
Sample from random points in the input.
|
static interface |
InputSampler.Sampler<K,V>
Interface to sample using an
InputFormat . |
static class |
InputSampler.SplitSampler<K,V>
Samples the first n records from s splits.
|
Constructor and Description |
---|
InputSampler(org.apache.hadoop.conf.Configuration conf) |
Modifier and Type | Method and Description |
---|---|
static void |
main(String[] args) |
int |
run(String[] args)
Driver for InputSampler from the command line.
|
static <K,V> void |
writePartitionFile(Job job,
InputSampler.Sampler<K,V> sampler)
Write a partition file for the given job, using the Sampler provided.
|
public static <K,V> void writePartitionFile(Job job, InputSampler.Sampler<K,V> sampler) throws IOException, ClassNotFoundException, InterruptedException
TotalOrderPartitioner.getPartitionFile(org.apache.hadoop.conf.Configuration)
.public int run(String[] args) throws Exception
writePartitionFile(org.apache.hadoop.mapreduce.Job, org.apache.hadoop.mapreduce.lib.partition.InputSampler.Sampler<K, V>)
.run
in interface org.apache.hadoop.util.Tool
Exception
Copyright © 2017 Apache Software Foundation. All Rights Reserved.