Uploaded image for project: 'Apache Drill'
  1. Apache Drill
  2. DRILL-4337

Drill fails to read INT96 fields from hive generated parquet files

    XMLWordPrintableJSON

Details

    • Bug
    • Status: Resolved
    • Blocker
    • Resolution: Cannot Reproduce
    • None
    • 1.14.0
    • None
    • None

    Description

      git.commit.id.abbrev=576271d
      Cluster : 2 nodes running MaprFS 4.1

      The data file used in the below table is generated from hive. Below is output from running the same query multiple times.

      0: jdbc:drill:zk=10.10.100.190:5181> select timestamp_col from hive1_fewtypes_null;
      Error: SYSTEM ERROR: NegativeArraySizeException
      
      Fragment 0:0
      
      [Error Id: 5517e983-ccae-4c96-b09c-30f331919e56 on qa-node191.qa.lab:31010] (state=,code=0)
      
      0: jdbc:drill:zk=10.10.100.190:5181> select timestamp_col from hive1_fewtypes_null;
      Error: SYSTEM ERROR: IllegalArgumentException: Reading past RLE/BitPacking stream.
      
      Fragment 0:0
      
      [Error Id: 94ed5996-d2ac-438d-b460-c2d2e41bdcc3 on qa-node191.qa.lab:31010] (state=,code=0)
      
      0: jdbc:drill:zk=10.10.100.190:5181> select timestamp_col from hive1_fewtypes_null;
      Error: SYSTEM ERROR: ArrayIndexOutOfBoundsException: 0
      
      Fragment 0:0
      
      [Error Id: 41dca093-571e-49e5-a2ab-fd69210b143d on qa-node191.qa.lab:31010] (state=,code=0)
      
      0: jdbc:drill:zk=10.10.100.190:5181> select timestamp_col from hive1_fewtypes_null;
      +----------------+
      | timestamp_col  |
      +----------------+
      | null           |
      | [B@7c766115    |
      | [B@3fdfe989    |
      | null           |
      | [B@55d4222     |
      | [B@2da0c8ee    |
      | [B@16e798a9    |
      | [B@3ed78afe    |
      | [B@38e649ed    |
      | [B@16ff83ca    |
      | [B@61254e91    |
      | [B@5849436a    |
      | [B@31e9116e    |
      | [B@3c77665b    |
      | [B@42e0ff60    |
      | [B@419e19ed    |
      | [B@72b83842    |
      | [B@1c75afe5    |
      | [B@726ef1fb    |
      | [B@51d0d06e    |
      | [B@64240fb8    |
      +----------------
      

      Attached the log, hive ddl used to generate the parquet file and the parquet file itself

      Attachments

        1. hive1_fewtypes_null.parquet
          3 kB
          Rahul Kumar Challapalli

        Issue Links

          Activity

            People

              vitalii Vitalii Diravka
              rkins Rahul Kumar Challapalli
              Votes:
              0 Vote for this issue
              Watchers:
              4 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: