Details
-
Sub-task
-
Status: Resolved
-
Blocker
-
Resolution: Fixed
-
3.0.0-alpha1
Description
The current erasure coding implementation uses replication factor field to store erasure coding policy.
Distcp copies the source file's replication factor to the destination if -pr is specified. However, if the source file is EC, the replication factor (which is EC policy) should not be replicated to the destination file. When a HdfsFileStatus is converted to FileStatus, the replication factor is set to 0 if it's an EC file.
In fact, I will attach a test case that shows trying to replicate the replication factor of an EC file results in an IOException: "Requested replication factor of 0 is less than the required minimum of 1 for /tmp/dst/dest2"
Attachments
Attachments
Issue Links
- is blocked by
-
HADOOP-13715 Add isErasureCoded() API to FileStatus class
- Resolved
- is related to
-
HADOOP-14254 Add a Distcp option to preserve Erasure Coding attributes
- Resolved