[DRILL-4337] Drill fails to read INT96 fields from hive generated parquet files - ASF JIRA

XML

Word

Printable

JSON

Details

Type: Bug
Status: Resolved
Priority: Blocker
Resolution: Cannot Reproduce
Affects Version/s: None
Fix Version/s: 1.14.0
Component/s: None
Labels:
None

Description

git.commit.id.abbrev=576271d
Cluster : 2 nodes running MaprFS 4.1

The data file used in the below table is generated from hive. Below is output from running the same query multiple times.

0: jdbc:drill:zk=10.10.100.190:5181> select timestamp_col from hive1_fewtypes_null;
Error: SYSTEM ERROR: NegativeArraySizeException

Fragment 0:0

[Error Id: 5517e983-ccae-4c96-b09c-30f331919e56 on qa-node191.qa.lab:31010] (state=,code=0)

0: jdbc:drill:zk=10.10.100.190:5181> select timestamp_col from hive1_fewtypes_null;
Error: SYSTEM ERROR: IllegalArgumentException: Reading past RLE/BitPacking stream.

Fragment 0:0

[Error Id: 94ed5996-d2ac-438d-b460-c2d2e41bdcc3 on qa-node191.qa.lab:31010] (state=,code=0)

0: jdbc:drill:zk=10.10.100.190:5181> select timestamp_col from hive1_fewtypes_null;
Error: SYSTEM ERROR: ArrayIndexOutOfBoundsException: 0

Fragment 0:0

[Error Id: 41dca093-571e-49e5-a2ab-fd69210b143d on qa-node191.qa.lab:31010] (state=,code=0)

0: jdbc:drill:zk=10.10.100.190:5181> select timestamp_col from hive1_fewtypes_null;
+----------------+
| timestamp_col  |
+----------------+
| null           |
| [B@7c766115    |
| [B@3fdfe989    |
| null           |
| [B@55d4222     |
| [B@2da0c8ee    |
| [B@16e798a9    |
| [B@3ed78afe    |
| [B@38e649ed    |
| [B@16ff83ca    |
| [B@61254e91    |
| [B@5849436a    |
| [B@31e9116e    |
| [B@3c77665b    |
| [B@42e0ff60    |
| [B@419e19ed    |
| [B@72b83842    |
| [B@1c75afe5    |
| [B@726ef1fb    |
| [B@51d0d06e    |
| [B@64240fb8    |
+----------------

Attached the log, hive ddl used to generate the parquet file and the parquet file itself

Attachments

- Sort By Name
- Sort By Date
- Ascending
- Descending

hive1_fewtypes_null.parquet
01/Feb/16 23:02
3 kB
Rahul Kumar Challapalli

Issue Links

is related to

DRILL-4742 Using convert_from timestamp_impala gives a random error

Resolved

DRILL-5495 convert_from function on top of int96 data results in ArrayIndexOutOfBoundsException

Resolved

Activity

People

Assignee:: Vitalii Diravka

Reporter:: Rahul Kumar Challapalli

Votes:: 0 Vote for this issue

Watchers:: 4 Start watching this issue

Dates

Created:: 01/Feb/16 22:58

Updated:: 13/Jul/18 17:27

Resolved:: 12/Jul/18 15:44