Uploaded image for project: 'Hive'
  1. Hive
  2. HIVE-284

Column pruning after join+group-by query

    XMLWordPrintableJSON

Details

    • Improvement
    • Status: Closed
    • Major
    • Resolution: Fixed
    • 0.3.0, 0.6.0
    • 0.4.0
    • Query Processor
    • None

    Description

      The query is:

      explain INSERT OVERWRITE TABLE t
      SELECT t1.r, t2.c, sum(t1.v * t2.v)
      FROM t1 join t2 on t1.c = t2.r
      GROUP BY t1.r, t2.c;

      The FileSinkOperator after the join is serializing all 6 columns from the 2 tables (both have 3 columns: r, c, v) instead of the 4 that is needed later.

      Attachments

        Issue Links

          Activity

            People

              Unassigned Unassigned
              zshao Zheng Shao
              Votes:
              0 Vote for this issue
              Watchers:
              0 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: