Details
Description
Block recovery will fail indefinitely if the time to recover a block is always longer than the heartbeat interval. Scenario:
1. DN sends heartbeat
2. NN sends a recovery command to DN, recoveryID=X
3. DN starts recovery
4. DN sends another heartbeat
5. NN sends a recovery command to DN, recoveryID=X+1
6. DN calls commitBlockSyncronization after succeeding with first recovery to NN, which fails because X < X+1
...
Attachments
Attachments
Issue Links
- is related to
-
HDFS-15787 Remove unnecessary Lease Renew in FSNamesystem#internalReleaseLease
- Resolved
- relates to
-
HDFS-5012 replica.getGenerationStamp() may be >= recoveryId
- Resolved