Details
-
Improvement
-
Status: Resolved
-
Major
-
Resolution: Duplicate
-
None
-
None
-
None
-
None
Description
During the investigation of a customer issue, I found that tez generated a dag plan containing >4k tasks. It failed for hive because of bucket number limitations (4k). It can be configured properly, e.g. bigger splits (tez.grouping.min-size), but maybe it would be more convenient for users to config a hard limit for the number of splits.