[IMPALA-10974] Impala cannot resolve columns of converted Iceberg table - ASF JIRA

Agile Board

Attach files

Attach Screenshot

Voters

Watch issue

Watchers

Create sub-task

Link

Clone

Update Comment Author

Replace String in Comment

Update Comment Visibility

Delete Comments

XML

Word

Printable

JSON

Details

Type: Bug
Status: Resolved
Priority: Major
Resolution: Fixed
Affects Version/s: None
Fix Version/s: Impala 4.1.0
Component/s: Backend
Labels:
- impala-iceberg

Epic Color:
ghx-label-4

Description

When a regular Parquet/ORC table is converted to Iceberg via Hive, only the Iceberg metadata files need to be created. The data files can stay in place.

This causes problems when the data files don't have field ids for the schema elements. Currently Impala resolves columns in data files based on Iceberg field ids, but since they are missing, Impala raises an error or returns NULLs.

We could fallback to the default column resolution strategy when the data files lack field ids.

Attachments

Activity

Comment

This comment will be Viewable by All Users Viewable by All Users

Cancel

People

Assignee:: Zoltán Borók-Nagy

Reporter:: Zoltán Borók-Nagy

Votes:: 0 Vote for this issue

Watchers:: 3 Start watching this issue

Dates

Created:: 19/Oct/21 12:21

Updated:: 08/Nov/21 10:11

Resolved:: 04/Nov/21 17:59

Agile

View on Board

Impala cannot resolve columns of converted Iceberg table

Details

Description

Attachments

Attachments

Activity

People

Dates

Agile

Slack

Issue deployment