public static class InputSampler.IntervalSampler<K,V> extends Object implements InputSampler.Sampler<K,V>
| Modifier and Type | Field and Description |
|---|---|
protected double |
freq |
protected int |
maxSplitsSampled |
| Constructor and Description |
|---|
IntervalSampler(double freq)
Create a new IntervalSampler sampling all splits.
|
IntervalSampler(double freq,
int maxSplitsSampled)
Create a new IntervalSampler.
|
| Modifier and Type | Method and Description |
|---|---|
K[] |
getSample(InputFormat<K,V> inf,
Job job)
For each split sampled, emit when the ratio of the number of records
retained to the total record count is less than the specified
frequency.
|
public IntervalSampler(double freq)
freq - The frequency with which records will be emitted.public IntervalSampler(double freq,
int maxSplitsSampled)
freq - The frequency with which records will be emitted.maxSplitsSampled - The maximum number of splits to examine.getSample(org.apache.hadoop.mapreduce.InputFormat<K, V>, org.apache.hadoop.mapreduce.Job)public K[] getSample(InputFormat<K,V> inf, Job job) throws IOException, InterruptedException
getSample in interface InputSampler.Sampler<K,V>IOExceptionInterruptedExceptionCopyright © 2008–2022 Apache Software Foundation. All rights reserved.