Details
-
Improvement
-
Status: Resolved
-
Major
-
Resolution: Fixed
-
None
-
ghx-label-1
Description
One pitfall of multithreaded execution is that, if implemented naively, the number of files generated by an unpartitioned insert will be multiplied by mt_dop.
We should provide a mechanism to limit the number of files generated, e.g. limit the number of insert fragment instances (note that there a pre-existing problem with unpartitioned inserts generating too many files).
Attachments
Issue Links
- is related to
-
IMPALA-4746 num_nodes should take any value and use that many nodes
- Open
-
IMPALA-1108 Impala should check the number of opened files/partition during insert
- Closed
- relates to
-
IMPALA-1073 Consider extending the hints for insert
- Closed