Uploaded image for project: 'Hive'
  1. Hive
  2. HIVE-14096

Extend RS dedup logic to merge GBy operators

    XMLWordPrintableJSON

Details

    • Bug
    • Status: Open
    • Major
    • Resolution: Unresolved
    • 2.2.0
    • None
    • Physical Optimizer
    • None

    Description

      Since we always generate map-side GBy at plan generation time, there are occasion when we could collapse GBy after RS dedup optimization. This means that GBy would be executed in a single stage with mode = complete.

      Example in reduce_deduplicate_extended2.q.out:

      SELECT f.key, g.value
      FROM src f
      JOIN src g ON (f.key = g.key AND f.value = g.value)
      GROUP BY g.value, f.key
      

      Attachments

        Issue Links

          Activity

            People

              Unassigned Unassigned
              jcamacho Jesús Camacho Rodríguez
              Votes:
              0 Vote for this issue
              Watchers:
              1 Start watching this issue

              Dates

                Created:
                Updated: