Uploaded image for project: 'Hive'
  1. Hive
  2. HIVE-26654 Test with the TPC-DS benchmark
  3. HIVE-27269

VectorizedMapJoin returns wrong result for TPC-DS query 97

    XMLWordPrintableJSON

Details

    Description

      TPC-DS query 97 returns wrong results when hive.auto.convert.join and hive.vectorized.execution.enabled are set to true.

       

      Result of query 97 on 1TB text dataset:

      CommonMergeJoinOperator(hive.auto.convert.join=false): 534151529, 284185746, 84163

      MapJoinOperator(hive.auto.convert.join=true, hive.vectorized.execution.enabled=false): 534151529, 284185746, 84163

      VectorMapJoinOperator(hive.auto.convert.join=true, hive.vectorized.execution.enabled=true): 534151529, 284185388, 84163

       

      Also I observed that VectorizedMapJoin returns different results for 100GB dataset when I run query 97 twice, but I could not reproduce it since then.

      Attachments

        Issue Links

          Activity

            People

              seonggon Seonggon Namgung
              seonggon Seonggon Namgung
              Votes:
              0 Vote for this issue
              Watchers:
              3 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: