Uploaded image for project: 'Hive'
  1. Hive
  2. HIVE-21267

Extend HiveRelColumnsAlignment to reorder group-by and join keys on decreasing NDV automatically

    XMLWordPrintableJSON

Details

    • Improvement
    • Status: Open
    • Major
    • Resolution: Unresolved
    • None
    • None
    • CBO

    Description

      HiveRelColumnsAlignment was introduced to align the order of columns in join, group-by, and order-by operators in the plan pipeline, trying to increase the effect of ReduceDeduplication and thus reducing data shuffle.

      The optimization could be extended to reorder group-by and join keys on decreasing NDV, which would accelerate comparison runtime.

      Attachments

        Activity

          People

            jfs John Sherman
            jcamacho Jesús Camacho Rodríguez
            Votes:
            0 Vote for this issue
            Watchers:
            3 Start watching this issue

            Dates

              Created:
              Updated: