public abstract class TezSplitGrouper extends Object
| Modifier and Type | Class and Description | 
|---|---|
static class  | 
TezSplitGrouper.TezMRSplitsGrouperConfigBuilder  | 
| Modifier and Type | Field and Description | 
|---|---|
static String | 
TEZ_GROUPING_NODE_LOCAL_ONLY
Generate node local splits only. 
 | 
static boolean | 
TEZ_GROUPING_NODE_LOCAL_ONLY_DEFAULT  | 
static String | 
TEZ_GROUPING_RACK_SPLIT_SIZE_REDUCTION
This factor is used to decrease the per group desired (length and count) limits for groups
 created by combining splits within a rack. 
 | 
static float | 
TEZ_GROUPING_RACK_SPLIT_SIZE_REDUCTION_DEFAULT  | 
static String | 
TEZ_GROUPING_REPEATABLE
Repeated invocations of grouping on the same splits with the same parameters will produce the
 same groups. 
 | 
static boolean | 
TEZ_GROUPING_REPEATABLE_DEFAULT  | 
static String | 
TEZ_GROUPING_SPLIT_BY_COUNT
Limit the number of splits in a group by the number of splits in the group 
 | 
static boolean | 
TEZ_GROUPING_SPLIT_BY_COUNT_DEFAULT  | 
static String | 
TEZ_GROUPING_SPLIT_BY_LENGTH
Limit the number of splits in a group by the total length of the splits in the group 
 | 
static boolean | 
TEZ_GROUPING_SPLIT_BY_LENGTH_DEFAULT  | 
static String | 
TEZ_GROUPING_SPLIT_COUNT
Specify the number of splits desired to be created 
 | 
static String | 
TEZ_GROUPING_SPLIT_MAX_SIZE
Upper bound on the size (in bytes) of a grouped split, to avoid generating excessively large splits. 
 | 
static long | 
TEZ_GROUPING_SPLIT_MAX_SIZE_DEFAULT  | 
static String | 
TEZ_GROUPING_SPLIT_MIN_SIZE
Lower bound on the size (in bytes) of a grouped split, to avoid generating too many small splits. 
 | 
static long | 
TEZ_GROUPING_SPLIT_MIN_SIZE_DEFAULT  | 
static String | 
TEZ_GROUPING_SPLIT_WAVES
The multiplier for available queue capacity when determining number of
 tasks for a Vertex. 
 | 
static float | 
TEZ_GROUPING_SPLIT_WAVES_DEFAULT  | 
| Constructor and Description | 
|---|
TezSplitGrouper()  | 
| Modifier and Type | Method and Description | 
|---|---|
List<GroupedSplitContainer> | 
getGroupedSplits(org.apache.hadoop.conf.Configuration conf,
                List<SplitContainer> originalSplits,
                int desiredNumSplits,
                String wrappedInputFormatName,
                SplitSizeEstimatorWrapper estimator,
                SplitLocationProviderWrapper locationProvider)  | 
static TezSplitGrouper.TezMRSplitsGrouperConfigBuilder | 
newConfigBuilder(org.apache.hadoop.conf.Configuration conf)
Builder that can be used to configure grouping in Tez 
 | 
public static final String TEZ_GROUPING_SPLIT_COUNT
public static final String TEZ_GROUPING_SPLIT_BY_LENGTH
public static final boolean TEZ_GROUPING_SPLIT_BY_LENGTH_DEFAULT
public static final String TEZ_GROUPING_SPLIT_BY_COUNT
public static final boolean TEZ_GROUPING_SPLIT_BY_COUNT_DEFAULT
public static final String TEZ_GROUPING_SPLIT_WAVES
public static final float TEZ_GROUPING_SPLIT_WAVES_DEFAULT
public static final String TEZ_GROUPING_SPLIT_MAX_SIZE
public static final long TEZ_GROUPING_SPLIT_MAX_SIZE_DEFAULT
public static final String TEZ_GROUPING_SPLIT_MIN_SIZE
public static final long TEZ_GROUPING_SPLIT_MIN_SIZE_DEFAULT
public static final String TEZ_GROUPING_RACK_SPLIT_SIZE_REDUCTION
public static final float TEZ_GROUPING_RACK_SPLIT_SIZE_REDUCTION_DEFAULT
public static final String TEZ_GROUPING_REPEATABLE
public static final boolean TEZ_GROUPING_REPEATABLE_DEFAULT
public static final String TEZ_GROUPING_NODE_LOCAL_ONLY
public static final boolean TEZ_GROUPING_NODE_LOCAL_ONLY_DEFAULT
public List<GroupedSplitContainer> getGroupedSplits(org.apache.hadoop.conf.Configuration conf, List<SplitContainer> originalSplits, int desiredNumSplits, String wrappedInputFormatName, SplitSizeEstimatorWrapper estimator, SplitLocationProviderWrapper locationProvider) throws IOException, InterruptedException
IOExceptionInterruptedExceptionpublic static TezSplitGrouper.TezMRSplitsGrouperConfigBuilder newConfigBuilder(org.apache.hadoop.conf.Configuration conf)
conf - Configuration This will be modified in place. If
          configuration values may be changed at runtime via a config file
          then pass in a Configuration that is initialized from a
          config file. The parameters that are not overridden in code will
          be derived from the Configuration object.TezSplitGrouper.TezMRSplitsGrouperConfigBuilderCopyright © 2016 Apache Software Foundation. All rights reserved.