Uploaded image for project: 'IMPALA'
  1. IMPALA
  2. IMPALA-3701

Evaluate compressing Runtime filters to save coordinator network bandwidth

    XMLWordPrintableJSON

Details

    Description

      When running complex queries on large clusters with lots of runtime filters the coordinator quickly becomes network bound due to the extra incoming and outgoing traffic for runtime filters, once the coordinator becomes network bound all other fragments in the cluster are negatively affected as they get blocked on shuffling/broadcasting data to the coordinator node.

      This bottleneck was identified when running large scale tests on EC2 nodes with less than ideal network throughput.

      In attached png is aggregate network throughput across the 32 nodes in the cluster with the coordinator in red.

      Compression should alleviate this bottleneck but we should consider other solutions

      Attachments

        1. query17.sql.2.out
          1.26 MB
          Mostafa Mokhtar
        2. image-2016-06-08-22-55-36-966.png
          43 kB
          Mostafa Mokhtar

        Issue Links

          Activity

            People

              henryr Henry Robinson
              mmokhtar Mostafa Mokhtar
              Votes:
              0 Vote for this issue
              Watchers:
              4 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: