Uploaded image for project: 'IMPALA'
  1. IMPALA
  2. IMPALA-13501

Conflicting commits to Iceberg tables leave uncommitted orphan files

    XMLWordPrintableJSON

Details

    • Improvement
    • Status: Resolved
    • Major
    • Resolution: Fixed
    • None
    • Impala 4.5.0
    • Catalog
    • ghx-label-14

    Description

      Iceberg supports multiple writers with optimistic concurrency. Each writer can write new files which are then added to the table after a validation check to ensure that the commit does not conflict with other modifications made during the execution.

      When there was a conflicting change and the newly written files cannot be committed, there are 2 ways to proceed: the commit can be retried and rebased on top of the latest snapshot. If this cannot resolve the conflict, the change cannot be committed and the files become orphan files in the file system.

      It would be nice to remove the remaining files from an unsuccessful commit in one step. Deleting orphan files later as a table maintenance step is also a possible resolution.

      Attachments

        Activity

          People

            noemi Noemi Pap-Takacs
            noemi Noemi Pap-Takacs
            Votes:
            0 Vote for this issue
            Watchers:
            3 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved: