Uploaded image for project: 'IMPALA'
  1. IMPALA
  2. IMPALA-11971

Simple syntax to Compact, RePartition, Clean Orphans etc for Iceberg Tables

    XMLWordPrintableJSON

Details

    • Epic
    • Status: Open
    • Major
    • Resolution: Unresolved
    • None
    • None
    • Frontend
    • Iceberg table maintenance tasks
    • ghx-label-9

    Description

      Impala supports overwriting iceberg tables. Overwriting iceberg tables / partitions can be used as an cheap way to implement the below without the user having to run spark jobs

      • compact small files
      • rewrite partitions according to latest spec
      • merge deltas due to deletes and updates (if any) due to Merge-on-read strategy
      • delete orphan files in table/partition

      Doing all of this as part of an overwrite partition is not intuitive. We should support a syntax verb like `compact` or `consolidate` or `tune` to do these operations 

      alter table compact table;
      alter table compact table partition <>;
      alter table compact table partition <>;

      Attachments

        Activity

          People

            Unassigned Unassigned
            myloginid@gmail.com Manish Maheshwari
            Votes:
            0 Vote for this issue
            Watchers:
            6 Start watching this issue

            Dates

              Created:
              Updated: