Details
-
Sub-task
-
Status: Resolved
-
Major
-
Resolution: Fixed
-
None
-
None
Description
When Hive stores array data in parquet format, it creates schema for such columns, like:
arr_n_0 ARRAY<INT>
optional group arr_n_0 (LIST) { repeated group bag { optional int32 array_element; } }
Sample result before the changes was:
{"bag":[{"array_element":1},{"array_element":2}]}
After the changes Drill reads only array elements data without additional keys like "bag" or "array_element":
[1,2]
Please read Design Doc linked to parent task for more details.
Attachments
Issue Links
- links to