Details
-
Bug
-
Status: Closed
-
Major
-
Resolution: Fixed
-
0.23.3
-
None
-
None
-
distcp skips CRC on 0 byte files.
Description
DistcpV2 (hadoop-tools/hadoop-distcp/..) can fail from checksum failure, sometimes when copying a 0 byte file. Root cause of this may have to do with an inconsistent nature of HDFS when creating 0 byte files, however distcp can avoid this issue by not checking CRC when size is zero.
This issue was reported as part of HADOOP-8233, though it seems like a better idea to treat this particular aspect on it's own.