Uploaded image for project: 'IMPALA'
  1. IMPALA
  2. IMPALA-6822 Provide a query option to not shuffle on distinct exprs
  3. IMPALA-6867

Impala 2.12 & 3.0 Docs: Provide a query option to not shuffle on distinct exprs

    Details

    • Type: Sub-task
    • Status: Closed
    • Priority: Major
    • Resolution: Fixed
    • Affects Version/s: Impala 3.0, Impala 2.12.0
    • Fix Version/s: Impala 3.0, Impala 2.12.0
    • Component/s: Docs
    • Labels:
      None
    • Epic Color:
      ghx-label-3

      Description

      https://gerrit.cloudera.org/#/c/9949/

      New query option:
      SHUFFLE_DISTINCT_EXPRS

      This options controls the shuffling behavior when a query has both grouping and distinct exprs. Impala can optionally include the distinct exprs in the hash exchange of the first aggregation phase to spread the data among more nodes. However, this plan requires another hash exchange on the grouping exprs in the second phase which is not required when omitting the distinct exprs in the first phase. Turning it off is recommended if the NDVs of the grouping exprs is high.

        Attachments

          Activity

            People

            • Assignee:
              arodoni_cloudera Alex Rodoni
              Reporter:
              arodoni_cloudera Alex Rodoni
            • Votes:
              0 Vote for this issue
              Watchers:
              2 Start watching this issue

              Dates

              • Created:
                Updated:
                Resolved: