Uploaded image for project: 'Apache Drill'
  1. Apache Drill
  2. DRILL-4337

Drill fails to read INT96 fields from hive generated parquet files

Attach filesAttach ScreenshotVotersWatch issueWatchersCreate sub-taskLinkCloneUpdate Comment AuthorReplace String in CommentUpdate Comment VisibilityDelete Comments
    XMLWordPrintableJSON

Details

    • Bug
    • Status: Resolved
    • Blocker
    • Resolution: Cannot Reproduce
    • None
    • 1.14.0
    • None
    • None

    Description

      git.commit.id.abbrev=576271d
      Cluster : 2 nodes running MaprFS 4.1

      The data file used in the below table is generated from hive. Below is output from running the same query multiple times.

      0: jdbc:drill:zk=10.10.100.190:5181> select timestamp_col from hive1_fewtypes_null;
      Error: SYSTEM ERROR: NegativeArraySizeException
      
      Fragment 0:0
      
      [Error Id: 5517e983-ccae-4c96-b09c-30f331919e56 on qa-node191.qa.lab:31010] (state=,code=0)
      
      0: jdbc:drill:zk=10.10.100.190:5181> select timestamp_col from hive1_fewtypes_null;
      Error: SYSTEM ERROR: IllegalArgumentException: Reading past RLE/BitPacking stream.
      
      Fragment 0:0
      
      [Error Id: 94ed5996-d2ac-438d-b460-c2d2e41bdcc3 on qa-node191.qa.lab:31010] (state=,code=0)
      
      0: jdbc:drill:zk=10.10.100.190:5181> select timestamp_col from hive1_fewtypes_null;
      Error: SYSTEM ERROR: ArrayIndexOutOfBoundsException: 0
      
      Fragment 0:0
      
      [Error Id: 41dca093-571e-49e5-a2ab-fd69210b143d on qa-node191.qa.lab:31010] (state=,code=0)
      
      0: jdbc:drill:zk=10.10.100.190:5181> select timestamp_col from hive1_fewtypes_null;
      +----------------+
      | timestamp_col  |
      +----------------+
      | null           |
      | [B@7c766115    |
      | [B@3fdfe989    |
      | null           |
      | [B@55d4222     |
      | [B@2da0c8ee    |
      | [B@16e798a9    |
      | [B@3ed78afe    |
      | [B@38e649ed    |
      | [B@16ff83ca    |
      | [B@61254e91    |
      | [B@5849436a    |
      | [B@31e9116e    |
      | [B@3c77665b    |
      | [B@42e0ff60    |
      | [B@419e19ed    |
      | [B@72b83842    |
      | [B@1c75afe5    |
      | [B@726ef1fb    |
      | [B@51d0d06e    |
      | [B@64240fb8    |
      +----------------
      

      Attached the log, hive ddl used to generate the parquet file and the parquet file itself

      Attachments

        Issue Links

        Activity

          This comment will be Viewable by All Users Viewable by All Users
          Cancel

          People

            vitalii Vitalii Diravka
            rkins Rahul Kumar Challapalli
            Votes:
            0 Vote for this issue
            Watchers:
            4 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved:

              Slack

                Issue deployment