Uploaded image for project: 'IMPALA'
  1. IMPALA
  2. IMPALA-10569

Impala should determine Iceberg data file format from Iceberg metadata

    XMLWordPrintableJSON

Details

    • ghx-label-7

    Description

      When Impala creates an Iceberg table it sets HMS table property 'iceberg.file_format' to indicate the underlying data file format.

      However, when the table was created by Hive or Spark, we don't have this property and Impala assumes that the data file format is PARQUET. This assumption is just a wild guess, and when it's wrong Impala raises an error during query execution.

      Instead of only checking the table property, Impala could also try to determine the file format based on Iceberg metadata.

      Attachments

        Issue Links

          Activity

            People

              noemi Noemi Pap-Takacs
              boroknagyz Zoltán Borók-Nagy
              Votes:
              0 Vote for this issue
              Watchers:
              4 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: