Details
-
Sub-task
-
Status: Resolved
-
Major
-
Resolution: Fixed
-
None
-
None
-
None
-
Reviewed
Description
This would suggest and refactor ErasureCodingWorker into smaller constructs to be reused in other places like block group checksum computing in datanode side. As discussed in HDFS-8430 and implemented in HDFS-9694 patch, checksum computing for striped block groups would be distributed to datanode in the group, where data block data should be able to be reconstructed when missed/corrupted to recompute the block checksum. The most needed codes are in the current ErasureCodingWorker and could be reused in order to avoid duplication. Fortunately, we have very good and complete tests, which would make the refactoring much easier. The refactoring will also help a lot for subsequent tasks in phase II for non-striping erasure coded files and blocks.