Details
-
Sub-task
-
Status: Resolved
-
Major
-
Resolution: Fixed
-
3.3.2
Description
To support recovery of comms failure during rename, the abfs client fetches the etag of the source file, and when recovering from a failure uses this tag to determine whether the rename succeeded before the failure happened
The relevant configuration option is fs.azure.enable.rename.resilience; default value is: true
- This works for files, but not directories
- this adds the overhead of a HEAD request before each rename.
- the option can be disabled by setting "fs.azure.enable.rename.resilience" to false
Note: the manifest committer collects etags during task commit and supplies them to the abfs client for the rename, which avoids the need for a HEAD call.
Attachments
Issue Links
- is duplicated by
-
HADOOP-18667 ABFS: eTag Check to mitigate Rename Failures
- Resolved
-
HADOOP-18425 [ABFS]: RenameFilePath Source File Not Found (404) error in retry loop
- Resolved
- relates to
-
HADOOP-17934 NullPointerException when no HTTP response set on AbfsRestOperation
- Resolved
-
HADOOP-18002 abfs rename idempotency broken -remove recovery
- Resolved
-
HADOOP-17979 Interface EtagSource to allow FileStatus subclasses to provide etags
- Resolved
-
MAPREDUCE-7341 Add a task-manifest output committer for Azure and GCS
- Resolved
- links to