public abstract class TezSplitGrouper extends Object
Modifier and Type | Class and Description |
---|---|
static class |
TezSplitGrouper.TezMRSplitsGrouperConfigBuilder |
Modifier and Type | Field and Description |
---|---|
static String |
TEZ_GROUPING_NODE_LOCAL_ONLY
Generate node local splits only.
|
static boolean |
TEZ_GROUPING_NODE_LOCAL_ONLY_DEFAULT |
static String |
TEZ_GROUPING_RACK_SPLIT_SIZE_REDUCTION
This factor is used to decrease the per group desired (length and count) limits for groups
created by combining splits within a rack.
|
static float |
TEZ_GROUPING_RACK_SPLIT_SIZE_REDUCTION_DEFAULT |
static String |
TEZ_GROUPING_REPEATABLE
Repeated invocations of grouping on the same splits with the same parameters will produce the
same groups.
|
static boolean |
TEZ_GROUPING_REPEATABLE_DEFAULT |
static String |
TEZ_GROUPING_SPLIT_BY_COUNT
Limit the number of splits in a group by the number of splits in the group
|
static boolean |
TEZ_GROUPING_SPLIT_BY_COUNT_DEFAULT |
static String |
TEZ_GROUPING_SPLIT_BY_LENGTH
Limit the number of splits in a group by the total length of the splits in the group
|
static boolean |
TEZ_GROUPING_SPLIT_BY_LENGTH_DEFAULT |
static String |
TEZ_GROUPING_SPLIT_COUNT
Specify the number of splits desired to be created
|
static String |
TEZ_GROUPING_SPLIT_MAX_SIZE
Upper bound on the size (in bytes) of a grouped split, to avoid generating excessively large splits.
|
static long |
TEZ_GROUPING_SPLIT_MAX_SIZE_DEFAULT |
static String |
TEZ_GROUPING_SPLIT_MIN_SIZE
Lower bound on the size (in bytes) of a grouped split, to avoid generating too many small splits.
|
static long |
TEZ_GROUPING_SPLIT_MIN_SIZE_DEFAULT |
static String |
TEZ_GROUPING_SPLIT_WAVES
The multiplier for available queue capacity when determining number of
tasks for a Vertex.
|
static float |
TEZ_GROUPING_SPLIT_WAVES_DEFAULT |
Constructor and Description |
---|
TezSplitGrouper() |
Modifier and Type | Method and Description |
---|---|
List<GroupedSplitContainer> |
getGroupedSplits(org.apache.hadoop.conf.Configuration conf,
List<SplitContainer> originalSplits,
int desiredNumSplits,
String wrappedInputFormatName,
SplitSizeEstimatorWrapper estimator,
SplitLocationProviderWrapper locationProvider) |
static TezSplitGrouper.TezMRSplitsGrouperConfigBuilder |
newConfigBuilder(org.apache.hadoop.conf.Configuration conf)
Builder that can be used to configure grouping in Tez
|
public static final String TEZ_GROUPING_SPLIT_COUNT
public static final String TEZ_GROUPING_SPLIT_BY_LENGTH
public static final boolean TEZ_GROUPING_SPLIT_BY_LENGTH_DEFAULT
public static final String TEZ_GROUPING_SPLIT_BY_COUNT
public static final boolean TEZ_GROUPING_SPLIT_BY_COUNT_DEFAULT
public static final String TEZ_GROUPING_SPLIT_WAVES
public static final float TEZ_GROUPING_SPLIT_WAVES_DEFAULT
public static final String TEZ_GROUPING_SPLIT_MAX_SIZE
public static final long TEZ_GROUPING_SPLIT_MAX_SIZE_DEFAULT
public static final String TEZ_GROUPING_SPLIT_MIN_SIZE
public static final long TEZ_GROUPING_SPLIT_MIN_SIZE_DEFAULT
public static final String TEZ_GROUPING_RACK_SPLIT_SIZE_REDUCTION
public static final float TEZ_GROUPING_RACK_SPLIT_SIZE_REDUCTION_DEFAULT
public static final String TEZ_GROUPING_REPEATABLE
public static final boolean TEZ_GROUPING_REPEATABLE_DEFAULT
public static final String TEZ_GROUPING_NODE_LOCAL_ONLY
public static final boolean TEZ_GROUPING_NODE_LOCAL_ONLY_DEFAULT
public List<GroupedSplitContainer> getGroupedSplits(org.apache.hadoop.conf.Configuration conf, List<SplitContainer> originalSplits, int desiredNumSplits, String wrappedInputFormatName, SplitSizeEstimatorWrapper estimator, SplitLocationProviderWrapper locationProvider) throws IOException, InterruptedException
IOException
InterruptedException
public static TezSplitGrouper.TezMRSplitsGrouperConfigBuilder newConfigBuilder(org.apache.hadoop.conf.Configuration conf)
conf
- Configuration
This will be modified in place. If
configuration values may be changed at runtime via a config file
then pass in a Configuration
that is initialized from a
config file. The parameters that are not overridden in code will
be derived from the Configuration object.TezSplitGrouper.TezMRSplitsGrouperConfigBuilder
Copyright © 2016 Apache Software Foundation. All rights reserved.