Uploaded image for project: 'Hive'
  1. Hive
  2. HIVE-15782

query on parquet table returns incorrect result when hive.optimize.index.filter is set to true

    XMLWordPrintableJSON

Details

    • Bug
    • Status: Resolved
    • Major
    • Resolution: Fixed
    • 2.2.0
    • 2.3.0
    • File Formats
    • None

    Description

      When hive.optimize.index.filter is set to true, the parquet table is filtered using the parquet column index.

      set hive.optimize.index.filter=true;
      CREATE TABLE t1 (
        name string,
        dec decimal(5,0)
      ) stored as parquet;
      
      insert into table t1 values('Jim', 3);
      insert into table t1 values('Tom', 5);
      
      select * from t1 where (name = 'Jim' or dec = 5);
      

      Only one row Jim, 3 is returned, but both should be returned.

      Attachments

        1. HIVE-15782.1.patch
          8 kB
          Aihua Xu
        2. HIVE-15782.2.patch
          12 kB
          Aihua Xu

        Activity

          People

            aihuaxu Aihua Xu
            aihuaxu Aihua Xu
            Votes:
            0 Vote for this issue
            Watchers:
            4 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved: