Uploaded image for project: 'Hive'
  1. Hive
  2. HIVE-15782

query on parquet table returns incorrect result when hive.optimize.index.filter is set to true

    Details

    • Type: Bug
    • Status: Resolved
    • Priority: Major
    • Resolution: Fixed
    • Affects Version/s: 2.2.0
    • Fix Version/s: 2.3.0
    • Component/s: File Formats
    • Labels:
      None

      Description

      When hive.optimize.index.filter is set to true, the parquet table is filtered using the parquet column index.

      set hive.optimize.index.filter=true;
      CREATE TABLE t1 (
        name string,
        dec decimal(5,0)
      ) stored as parquet;
      
      insert into table t1 values('Jim', 3);
      insert into table t1 values('Tom', 5);
      
      select * from t1 where (name = 'Jim' or dec = 5);
      

      Only one row Jim, 3 is returned, but both should be returned.

        Attachments

        1. HIVE-15782.2.patch
          12 kB
          Aihua Xu
        2. HIVE-15782.1.patch
          8 kB
          Aihua Xu

          Activity

            People

            • Assignee:
              aihuaxu Aihua Xu
              Reporter:
              aihuaxu Aihua Xu
            • Votes:
              0 Vote for this issue
              Watchers:
              4 Start watching this issue

              Dates

              • Created:
                Updated:
                Resolved: