Uploaded image for project: 'Hive'
  1. Hive
  2. HIVE-20393

Semijoin Reduction : markSemiJoinForDPP behaves inconsistently

    XMLWordPrintableJSON

Details

    • Bug
    • Status: Resolved
    • Major
    • Resolution: Fixed
    • None
    • None
    • None
    • None

    Description

      markSemiJoinForDPP has multiple issues,

       

      • Uses map tsOps which is wrong as it disallows going thru same TS which may have filters from more than 1 semijoin edges. This results in inconsistent plans for same query as semijoin edges may be processed in different order each time.
      • Uses getColumnExpr() which is not as robust as extractColumn() thus resulting in NPEs.
      • The logic to mark an edge useful when NPE is hit may end up having bad edge.

      cc gopalv

      Attachments

        1. HIVE-20393.2.patch
          5 kB
          Deepak Jaiswal
        2. HIVE-20393.1.patch
          3 kB
          Deepak Jaiswal

        Issue Links

          Activity

            People

              djaiswal Deepak Jaiswal
              djaiswal Deepak Jaiswal
              Votes:
              0 Vote for this issue
              Watchers:
              1 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: