Uploaded image for project: 'Apache Hudi'
  1. Apache Hudi
  2. HUDI-2475

Rolling Upgrade downgrade story for 0.10 & enabling metadata

Attach filesAttach ScreenshotVotersWatch issueWatchersCreate sub-taskLinkCloneUpdate Comment AuthorReplace String in CommentUpdate Comment VisibilityDelete Comments
    XMLWordPrintableJSON

Details

    • Task
    • Status: Closed
    • Blocker
    • Resolution: Fixed
    • None
    • 0.10.0
    • None

    Description

      Upgrade downgrade infra for enabling metadata.

       

      If user is having a writer process and clustering/compaction running async.

       

      • New synchronous metadata design, has a constraint that once metadata table is bootstrapped, all commits will happen synchronously. In other words, there is no catch up business wrt datatable.  

      So, it may not be feasible to do rolling upgrade (i.e. upgrade writer first while async compaction is running) and then upgrade async compaction. 

      Bootstrap has to be done by stopping all processes and then we can restart all other processes one by one (by using the upgraded hudi library) w/ metadata enabled.  

      This is the only viable option I can think of. 

      1. Stop all processes. Upgrade to hudi to a version w/ synchronous metadata. bring up one writer process w/ metadata config enabled. this will bootstrap the metadata table. and from there on, any new commits by the writer will do synchronous updates to metadata.  

      Note: users can choose to upgrade via hudi-cli if need be. but easier would be to just start the writer. Expect some delay for first commit since bootstrap will be happening. 

      2. Once first commit in previous writer process completes successfully, we can restart all other processes. Upgrade the async table service (to hudi version w/ metadata enabled) and restart it. Ensure metadata table is enabled across all processes.  Even if missed on one, could result in data loss.

       

      By this, once metadata table is bootstrapped, any new commits from all processes will be synced to metadata. 

       

       

       

      Attachments

        Activity

          This comment will be Viewable by All Users Viewable by All Users
          Cancel

          People

            manojg Manoj Govindassamy
            shivnarayan sivabalan narayanan
            Prashant Wason
            Votes:
            0 Vote for this issue
            Watchers:
            4 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved:

              Slack

                Issue deployment