Uploaded image for project: 'Hive'
  1. Hive
  2. HIVE-26447

Vectorization: wrong results when filter on repeating map key orc table

    XMLWordPrintableJSON

Details

    • Reviewed

    Description

      Example reproducible case:

       

      set hive.vectorized.execution.enabled=true;

      set hive.fetch.task.conversion=none;

      create temporary table foo (id int, x map<string,int>) stored as orc;
      insert into foo values(1, map('ABC', 9)), (2, map('ABC', 7)), (3, map('ABC', 8)), (4, map('ABC', 9));

      select id from foo where x['ABC']=9;

      this only gives 1, when correct result should be 1,4

      For every VectorizedRowBatch, only the first row is checked.  

      This seems to be a corner case of ORC table have repeating string type key for map field in the MapColumnVector.

      Attachments

        Issue Links

          Activity

            People

              yigress Yi Zhang
              yigress Yi Zhang
              Votes:
              0 Vote for this issue
              Watchers:
              3 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved:

                Time Tracking

                  Estimated:
                  Original Estimate - Not Specified
                  Not Specified
                  Remaining:
                  Remaining Estimate - 0h
                  0h
                  Logged:
                  Time Spent - 3.5h
                  3.5h