Uploaded image for project: 'Hadoop Common'
  1. Hadoop Common
  2. HADOOP-18072 Über-JIRA: abfs phase III: Hadoop 3.4.0 features & fixes
  3. HADOOP-18012

ABFS: Enable config controlled ETag check for Rename idempotency

    XMLWordPrintableJSON

Details

    Description

      To support recovery of comms failure during rename, the abfs client fetches the etag of the source file, and when recovering from a failure uses this tag to determine whether the rename succeeded before the failure happened

      The relevant configuration option is fs.azure.enable.rename.resilience; default value is: true

      1. This works for files, but not directories
      2. this adds the overhead of a HEAD request before each rename.
      3. the option can be disabled by setting "fs.azure.enable.rename.resilience" to false

      Note: the manifest committer collects etags during task commit and supplies them to the abfs client for the rename, which avoids the need for a HEAD call.

      Attachments

        Issue Links

          Activity

            People

              sreeb Sree Bhattacharyya
              snvijaya Sneha Vijayarajan
              Votes:
              0 Vote for this issue
              Watchers:
              3 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: