Details
-
Bug
-
Status: Closed
-
Critical
-
Resolution: Fixed
-
2.0.4-alpha, 2.2.0
-
None
-
Reviewed
Description
CombineFileInputFormat can easily create splits that can come from many different locations (during the last pass of creating "global" splits). However, we observe that this often runs afoul of the mapreduce.job.max.split.locations check that's done by JobSplitWriter.
The default value for mapreduce.job.max.split.locations is 10, and with any decent size cluster, CombineFileInputFormat creates splits that are well above this limit.
Attachments
Attachments
Issue Links
- is related to
-
MAPREDUCE-1943 Implement limits on per-job JobConf, Counters, StatusReport, Split-Sizes
- Closed
-
MAPREDUCE-4146 Support limits on task status string length and number of block locations in branch-2
- Closed