Uploaded image for project: 'IMPALA'
  1. IMPALA
  2. IMPALA-10358

Correct Iceberg type mappings

    XMLWordPrintableJSON

Details

    • Bug
    • Status: Resolved
    • Major
    • Resolution: Fixed
    • None
    • Impala 4.0.0
    • None

    Description

      The Iceberg format spec defines what types to use for different file formats, e.g.:

      https://iceberg.apache.org/spec/#parquet

      Impala should follow the specification:

      • Strings in Iceberg tables should be annotated with UTF8 (when file format is Parquet)
      • Revise fixed(L) <-> CHAR(L) mapping
        • CHAR(L) in Impala is annotated with UTF8
        • fixed(L) is for fixed-length binary data
        • We should probably remove this mapping

      UPDATE: A recent pull request added a new Iceberg class, HiveSchemaConverter. We should use this class to convert between Iceberg and Hive schemas.

      Attachments

        Issue Links

          Activity

            People

              boroknagyz Zoltán Borók-Nagy
              boroknagyz Zoltán Borók-Nagy
              Votes:
              0 Vote for this issue
              Watchers:
              3 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: