Uploaded image for project: 'Apache Drill'
  1. Apache Drill
  2. DRILL-4667

Improve memory footprint of broadcast joins

    XMLWordPrintableJSON

Details

    • Improvement
    • Status: Open
    • Major
    • Resolution: Unresolved
    • 1.6.0
    • None
    • None

    Description

      For broadcast joins, currently Drill optimizes the data transfer across the network for broadcast table by sending a single copy to the receiving node which then distributes it to all minor fragments running on that particular node. However, each minor fragment builds its own hash table (for a hash join) using this broadcast table. We can substantially improve the memory footprint by having a shared copy of the hash table among multiple minor fragments on a node.

      Attachments

        Activity

          People

            ben-zvi Boaz Ben-Zvi
            amansinha100 Aman Sinha
            Votes:
            1 Vote for this issue
            Watchers:
            4 Start watching this issue

            Dates

              Created:
              Updated: