Uploaded image for project: 'Hive'
  1. Hive
  2. HIVE-8699 Enable support for common map join [Spark Branch]
  3. HIVE-8702

Extra MapTask created but not connected [Spark Branch]

    XMLWordPrintableJSON

Details

    • Sub-task
    • Status: Resolved
    • Major
    • Resolution: Invalid
    • None
    • None
    • Spark
    • None

    Description

      Based on Szehon's observation, there is a strange extra maptask generated but not connected. Here is the query to demonstrate:

      select * FROM
      (SELECT avg(key) as x1, value as x2 FROM src group by value) x
      JOIN
      (SELECT avg(key) as y1, value as y2 FROM src group by value) y ON (x1 = y1)
      JOIN
      (SELECT avg(key) as z1, value as z2 FROM src group by value) z ON (x1 = z1);
      

      We shouldn't generate it in the first place.

      Attachments

        Activity

          People

            szehon Szehon Ho
            xuefuz Xuefu Zhang
            Votes:
            0 Vote for this issue
            Watchers:
            3 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved: