Details
-
Improvement
-
Status: Resolved
-
Major
-
Resolution: Fixed
-
None
-
None
-
None
Description
Many existing names should be revised to avoid confusion when blocks can be both replicated and erasure coded. This JIRA aims to solicit opinions on making those names more consistent and intuitive.
- In current HDFS block recovery refers to the process of finalizing the last block of a file, triggered by lease recovery. It is different from the intuitive meaning of recovering a lost block. To avoid confusion, I can think of 2 options:
- Rename this process as block finalization or block completion. I prefer this option because this is literally not a recovery.
- If we want to keep existing terms unchanged we can name all EC recovery and re-replication logics as reconstruction.
- As Kai suggested under
HDFS-7369, several replication-based names should be made more generic:- UnderReplicatedBlocks and neededReplications. E.g. we can use LowRedundancyBlocks/AtRiskBlocks, and neededRecovery/neededReconstruction.
- PendingReplicationBlocks
- ReplicationMonitor
I'm sure the above list is incomplete; discussions and comments are very welcome.