Uploaded image for project: 'Apache Hudi'
  1. Apache Hudi
  2. HUDI-1932

Hive Sync should not always update last_commit_time_sync

    XMLWordPrintableJSON

Details

    Description

      In Hive Sync 

      org.apache.hudi.hive.HiveSyncTool#syncHoodieTable()

      it includes schema update and partition update.

      Regardless of having update or not, the logic is to always update the table with a new timestamp for the last_commit_time_sync property.

      This will result in having new versions in AWS Glue tables for example, which will eventually lead to exceeding table version limit.

      Attachments

        Issue Links

          Activity

            People

              xushiyan Raymond Xu
              xushiyan Raymond Xu
              Votes:
              0 Vote for this issue
              Watchers:
              2 Start watching this issue

              Dates

                Created:
                Updated: