Details
-
Bug
-
Status: Patch Available
-
Major
-
Resolution: Unresolved
-
None
-
None
-
None
Description
Now, SplitSampler only samples the first maxSplitsSampled splits, caused by MAPREDUCE-1820. However, jumping around all splits is in general preferable than the first N splits.
Attachments
Attachments
Issue Links
- relates to
-
MAPREDUCE-5157 Sort in hadoop-1 examples is not binary compatible with hadoop-2 mapred.lib
- Closed
-
MAPREDUCE-1820 InputSampler does not create a deep copy of the key object when creating a sample, which causes problems with some formats like SequenceFile<Text,Text>
- Closed