Uploaded image for project: 'IMPALA'
  1. IMPALA
  2. IMPALA-10254

Load data files via Iceberg for Iceberg Tables

    XMLWordPrintableJSON

Details

    • Bug
    • Status: Closed
    • Major
    • Resolution: Fixed
    • None
    • None
    • Catalog

    Description

      Currently we still load the file descriptors of an Iceberg table via recursive file listing.

      This lists too many files, e.g. metadata files, files that are being written (can later throw checksum errors), files from aborted INSERTs, removed files, etc.

      We should use the Iceberg API to load the file descriptors corresponding to the table snapshot. Iceberg DataFiles might also already contain the split offsets.

      Attachments

        Activity

          People

            tmate Tamas Mate
            boroknagyz Zoltán Borók-Nagy
            Votes:
            0 Vote for this issue
            Watchers:
            4 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved: