Hive
  1. Hive
  2. HIVE-6551

group by after join with skew join optimization references invalid task sometimes

    Details

    • Type: Bug Bug
    • Status: Resolved
    • Priority: Trivial Trivial
    • Resolution: Fixed
    • Affects Version/s: None
    • Fix Version/s: 0.13.0
    • Component/s: None
    • Labels:
      None

      Description

      For example,

      hive> set hive.auto.convert.join = true;
      hive> set hive.optimize.skewjoin = true;
      hive> set hive.skewjoin.key = 3;
      hive> 
          > EXPLAIN FROM 
          > (SELECT src.* FROM src) x
          > JOIN 
          > (SELECT src.* FROM src) Y
          > ON (x.key = Y.key)
          > SELECT sum(hash(Y.key)), sum(hash(Y.value));
      OK
      STAGE DEPENDENCIES:
        Stage-8 is a root stage
        Stage-6 depends on stages: Stage-8
        Stage-5 depends on stages: Stage-6 , consists of Stage-4, Stage-2
        Stage-4
        Stage-2 depends on stages: Stage-4, Stage-1
        Stage-0 is a root stage
      ...
      

      Stage-2 references not-existing Stage-1

        Activity

        Navis created issue -
        Navis made changes -
        Field Original Value New Value
        Attachment HIVE-6551.1.patch.txt [ 12632789 ]
        Navis made changes -
        Status Open [ 1 ] Patch Available [ 10002 ]
        Ashutosh Chauhan made changes -
        Status Patch Available [ 10002 ] Resolved [ 5 ]
        Fix Version/s 0.14.0 [ 12326450 ]
        Resolution Fixed [ 1 ]
        Harish Butani made changes -
        Fix Version/s 0.13.0 [ 12324986 ]
        Fix Version/s 0.14.0 [ 12326450 ]

          People

          • Assignee:
            Navis
            Reporter:
            Navis
          • Votes:
            0 Vote for this issue
            Watchers:
            2 Start watching this issue

            Dates

            • Created:
              Updated:
              Resolved:

              Development