Uploaded image for project: 'IMPALA'
  1. IMPALA
  2. IMPALA-3902 Multi-threaded query execution
  3. IMPALA-9015

Use better algorithm for allocating scan ranges to finstances within a daemon in schedule

    XMLWordPrintableJSON

Details

    • Sub-task
    • Status: Resolved
    • Major
    • Resolution: Fixed
    • None
    • Impala 3.4.0
    • Distributed Exec
    • None
    • ghx-label-9

    Description

      Currently the scheduler uses a single-pass algorithm to allocate scan ranges to nodes. It has several deficiencies:

      • It doesn't guarantee that the desired number of instances are created. E.g. if mt_dop is 4, there are 3 impalads and 9 scan ranges, it should create 3 instances per impala but doesn't reliably.
      • It tends to over-allocate to the first instances it visits.
      • The result depends quite a bit on the input order of the scan ranges.

      Attachments

        Activity

          People

            tarmstrong Tim Armstrong
            tarmstrong Tim Armstrong
            Votes:
            0 Vote for this issue
            Watchers:
            2 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved: