Details
-
Bug
-
Status: Closed
-
Major
-
Resolution: Duplicate
-
0.1.0
-
None
-
None
-
None
Description
A random number generator is used to allocate block ids in dfs. Sometimes a block id is allocated that is already used in the filesystem, which causes filesystem corruption.
A short-term fix for this is to simply check when allocating block ids whether any file is already using the newly allocated id, and, if it is, generate another one. There can still be collisions in some rare conditions, but these are harder to fix and will wait, since this simple fix will handle the vast majority of collisions.
Attachments
Issue Links
- duplicates
-
HADOOP-146 potential conflict in block id's, leading to data corruption
- Closed
- is duplicated by
-
HADOOP-1497 Possibility of duplicate blockids if dead-datanodes come back up after corresponding files were deleted
- Closed