Details
-
Improvement
-
Status: Open
-
Major
-
Resolution: Unresolved
-
1.6.0
-
None
-
None
Description
For broadcast joins, currently Drill optimizes the data transfer across the network for broadcast table by sending a single copy to the receiving node which then distributes it to all minor fragments running on that particular node. However, each minor fragment builds its own hash table (for a hash join) using this broadcast table. We can substantially improve the memory footprint by having a shared copy of the hash table among multiple minor fragments on a node.