@InterfaceAudience.Public @InterfaceStability.Evolving public class TezMapReduceSplitsGrouper extends Object
TezGroupedSplit
 to wrap the each group of real InputSplits| Modifier and Type | Class and Description | 
|---|---|
static class  | 
TezMapReduceSplitsGrouper.TezMRSplitsGrouperConfigBuilder  | 
| Modifier and Type | Field and Description | 
|---|---|
static String | 
TEZ_GROUPING_RACK_SPLIT_SIZE_REDUCTION
This factor is used to decrease the per group desired (length and count) limits for groups
 created by combining splits within a rack. 
 | 
static float | 
TEZ_GROUPING_RACK_SPLIT_SIZE_REDUCTION_DEFAULT  | 
static String | 
TEZ_GROUPING_REPEATABLE
Repeated invocations of grouping on the same splits with the same parameters will produce the 
 same groups. 
 | 
static boolean | 
TEZ_GROUPING_REPEATABLE_DEFAULT  | 
static String | 
TEZ_GROUPING_SPLIT_BY_COUNT
Limit the number of splits in a group by the number of splits in the group 
 | 
static boolean | 
TEZ_GROUPING_SPLIT_BY_COUNT_DEFAULT  | 
static String | 
TEZ_GROUPING_SPLIT_BY_LENGTH
Limit the number of splits in a group by the total length of the splits in the group 
 | 
static boolean | 
TEZ_GROUPING_SPLIT_BY_LENGTH_DEFAULT  | 
static String | 
TEZ_GROUPING_SPLIT_COUNT
Specify the number of splits desired to be created 
 | 
static String | 
TEZ_GROUPING_SPLIT_MAX_SIZE
Upper bound on the size (in bytes) of a grouped split, to avoid generating excessively large splits. 
 | 
static long | 
TEZ_GROUPING_SPLIT_MAX_SIZE_DEFAULT  | 
static String | 
TEZ_GROUPING_SPLIT_MIN_SIZE
Lower bound on the size (in bytes) of a grouped split, to avoid generating too many small splits. 
 | 
static long | 
TEZ_GROUPING_SPLIT_MIN_SIZE_DEFAULT  | 
static String | 
TEZ_GROUPING_SPLIT_WAVES
The multiplier for available queue capacity when determining number of
 tasks for a Vertex. 
 | 
static float | 
TEZ_GROUPING_SPLIT_WAVES_DEFAULT  | 
| Constructor and Description | 
|---|
TezMapReduceSplitsGrouper()  | 
| Modifier and Type | Method and Description | 
|---|---|
static TezMapReduceSplitsGrouper.TezMRSplitsGrouperConfigBuilder | 
createConfigBuilder(org.apache.hadoop.conf.Configuration conf)
Builder that can be used to configure grouping in Tez 
 | 
List<org.apache.hadoop.mapreduce.InputSplit> | 
getGroupedSplits(org.apache.hadoop.conf.Configuration conf,
                List<org.apache.hadoop.mapreduce.InputSplit> originalSplits,
                int desiredNumSplits,
                String wrappedInputFormatName)  | 
List<org.apache.hadoop.mapreduce.InputSplit> | 
getGroupedSplits(org.apache.hadoop.conf.Configuration conf,
                List<org.apache.hadoop.mapreduce.InputSplit> originalSplits,
                int desiredNumSplits,
                String wrappedInputFormatName,
                SplitSizeEstimator estimator)  | 
public static final String TEZ_GROUPING_SPLIT_COUNT
public static final String TEZ_GROUPING_SPLIT_BY_LENGTH
public static final boolean TEZ_GROUPING_SPLIT_BY_LENGTH_DEFAULT
public static final String TEZ_GROUPING_SPLIT_BY_COUNT
public static final boolean TEZ_GROUPING_SPLIT_BY_COUNT_DEFAULT
public static final String TEZ_GROUPING_SPLIT_WAVES
public static final float TEZ_GROUPING_SPLIT_WAVES_DEFAULT
public static final String TEZ_GROUPING_SPLIT_MAX_SIZE
public static final long TEZ_GROUPING_SPLIT_MAX_SIZE_DEFAULT
public static final String TEZ_GROUPING_SPLIT_MIN_SIZE
public static final long TEZ_GROUPING_SPLIT_MIN_SIZE_DEFAULT
public static final String TEZ_GROUPING_RACK_SPLIT_SIZE_REDUCTION
public static final float TEZ_GROUPING_RACK_SPLIT_SIZE_REDUCTION_DEFAULT
public static final String TEZ_GROUPING_REPEATABLE
public static final boolean TEZ_GROUPING_REPEATABLE_DEFAULT
public List<org.apache.hadoop.mapreduce.InputSplit> getGroupedSplits(org.apache.hadoop.conf.Configuration conf, List<org.apache.hadoop.mapreduce.InputSplit> originalSplits, int desiredNumSplits, String wrappedInputFormatName) throws IOException, InterruptedException
IOExceptionInterruptedExceptionpublic List<org.apache.hadoop.mapreduce.InputSplit> getGroupedSplits(org.apache.hadoop.conf.Configuration conf, List<org.apache.hadoop.mapreduce.InputSplit> originalSplits, int desiredNumSplits, String wrappedInputFormatName, SplitSizeEstimator estimator) throws IOException, InterruptedException
IOExceptionInterruptedExceptionpublic static TezMapReduceSplitsGrouper.TezMRSplitsGrouperConfigBuilder createConfigBuilder(org.apache.hadoop.conf.Configuration conf)
conf - Configuration This will be modified in place. If
          configuration values may be changed at runtime via a config file
          then pass in a Configuration that is initialized from a
          config file. The parameters that are not overridden in code will
          be derived from the Configuration object.TezMapReduceSplitsGrouper.TezMRSplitsGrouperConfigBuilderCopyright © 2015 Apache Software Foundation. All rights reserved.