Details
-
Improvement
-
Status: Resolved
-
Minor
-
Resolution: Fixed
-
3.4.0
-
Reviewed
Description
As mentioned in https://issues.apache.org/jira/browse/HADOOP-15211
If -update or -overwrite is being passed when distcp used, the root directory will be skipped in two occasions (CopyListing#doBuildListing & CopyCommitter#preserveFileAttributesForDirectories), which will ignore root directory's attributes.
We face the same issue when distcp huge data between clusters and it takes too much effort to update root directories attributes manually.
From the earlier ticket it's obvious why this behaviour is there, but sometime we need to enforce root directory update hence I will add a new option for distcp to enable someone (who understands the need of this and know what they are doing) to enforce the update of root directory's attributes (permissions, ownership, ...)
It should be simple one, something like this
$ hadoop distcp -p -update -updateRootDirectoryAttributes /a/b/c /a/b/d
This behaviour is optional and will be false by default. (it should not affect existing distcp users).
Attachments
Issue Links
- fixes
-
HADOOP-15211 Distcp update not preserving root directory permissions
- Resolved
- links to