Details
-
New Feature
-
Status: Resolved
-
Major
-
Resolution: Duplicate
-
2.9.0
-
None
-
None
Description
There are opportunities to improve distcp delete performance and scalability with object stores, but you need to test with production datasets to determine if the optimizations work, don't run out of memory, etc.
By adding the option to save the sequence files of source, dest listings, people (myself included) can experiment with different strategies before trying to commit one which doesn't scale
Attachments
Attachments
Issue Links
- Is contained by
-
HADOOP-15209 DistCp to eliminate needless deletion of files under already-deleted directories
- Resolved
- is duplicated by
-
HADOOP-15209 DistCp to eliminate needless deletion of files under already-deleted directories
- Resolved
- relates to
-
HADOOP-15191 Add Private/Unstable BulkDelete operations to supporting object stores for DistCP
- Resolved