Uploaded image for project: 'Hive'
  1. Hive
  2. HIVE-26654 Test with the TPC-DS benchmark
  3. HIVE-26968

Wrong results when shared work optimizer merges TS operator with different DPP edges

    XMLWordPrintableJSON

Details

    Description

      SharedWorkOptimizer merges TableScan operators that have different DPP parents, which leads to the creation of semantically wrong query plan.

      In our environment, running TPC-DS query64 on 1TB Iceberg format table returns no rows  because of this problem. (The correct result has 7094 rows.)

      We use hive.optimize.shared.work=true, hive.optimize.shared.work.extended=true, and hive.optimize.shared.work.dppunion=false to reproduce the bug.

      Attachments

        1. TPC-DS Query64 OperatorGraph.pdf
          348 kB
          Seonggon Namgung

        Issue Links

          Activity

            People

              seonggon Seonggon Namgung
              seonggon Seonggon Namgung
              Votes:
              0 Vote for this issue
              Watchers:
              4 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved:

                Time Tracking

                  Estimated:
                  Original Estimate - Not Specified
                  Not Specified
                  Remaining:
                  Remaining Estimate - 0h
                  0h
                  Logged:
                  Time Spent - 50m
                  50m