Details
-
Improvement
-
Status: Closed
-
Major
-
Resolution: Fixed
-
None
-
None
-
None
Description
hive.exec.parallel allows one to run different stages of a query in parallel. However that applies only to map-reduce tasks. When using large multi insert queries there are many MoveTasks that are all executed in sequence on the client. There's no real reason for that - they could be run in parallel as well (i.e.: the stage graph captures the dependencies and knows which tasks can happen in parallel).
Attachments
Attachments
Issue Links
- breaks
-
HIVE-9665 Parallel move task optimization causes race condition
- Closed
- relates to
-
HIVE-14204 Optimize loading dynamic partitions
- Closed