[HDFS-988] saveNamespace race can corrupt the edits log - ASF JIRA

XML

Word

Printable

JSON

Details

Type: Bug
Status: Closed
Priority: Blocker
Resolution: Fixed
Affects Version/s: 0.20-append, 0.21.0, 0.22.0
Fix Version/s: 0.20-append, 0.20.205.0, 0.22.0
Component/s: namenode
Labels:
None

Hadoop Flags:

Reviewed
Tags:
hbase

Description

The adminstrator puts the namenode is safemode and then issues the savenamespace command. This can corrupt the edits log. The problem is that when the NN enters safemode, there could still be pending logSycs occuring from other threads. Now, the saveNamespace command, when executed, would save a edits log with partial writes. I have seen this happen on 0.20.

https://issues.apache.org/jira/browse/HDFS-909?focusedCommentId=12828853&page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#action_12828853

Attachments

- Sort By Name
- Sort By Date
- Ascending
- Descending

HDFS-988.20-security.patch
22/Aug/11 20:37
10 kB
Jitendra Nath Pandey
hdfs-988-7.patch
12/Jun/11 04:03
183 kB
Eli Collins
hdfs-988-6.patch
09/Jun/11 07:14
177 kB
Eli Collins
988-fixups.txt
02/Jun/11 05:25
6 kB
Todd Lipcon
hdfs-988-b22-1.patch
02/Jun/11 03:25
20 kB
Eli Collins
hdfs-988-5.patch
02/Jun/11 00:09
141 kB
Eli Collins
hdfs-988-4.patch
31/May/11 16:58
50 kB
Eli Collins
hdfs-988-3.patch
29/May/11 01:45
36 kB
Eli Collins
HDFS-988_fix_synchs.patch
18/May/11 18:15
3 kB
Matthew Foley
hdfs-988-2.patch
06/May/11 04:18
34 kB
Eli Collins
saveNamespace_20-append.patch
08/Jun/10 18:29
9 kB
Nicolas Spiegelberg
hdfs-988.txt
28/Apr/10 07:34
25 kB
Todd Lipcon
saveNamespace.txt
19/Feb/10 08:38
9 kB
Dhruba Borthakur

Issue Links

breaks

HDFS-2229 Deadlock in NameNode

Closed

duplicates

HDFS-1775 getContentSummary should hold the FSNamesystem readlock

Resolved

is depended upon by

HDFS-142 In 0.20, move blocks being written into a blocksBeingWritten directory

Closed

is related to

HDFS-955 FSImage.saveFSImage can lose edits

Resolved

HDFS-909 Race condition between rollEditLog or rollFSImage ant FSEditsLog.write operations corrupts edits log

Closed

HDFS-2052 FSNamesystem should not sync the log with the write lock held

Open

requires

HDFS-956 Improper synchronization in some FSNamesystem methods

Resolved

(1 is related to, 1 requires)

Activity

People

Assignee:: Eli Collins

Reporter:: Dhruba Borthakur

Votes:: 0 Vote for this issue

Watchers:: 20 Start watching this issue

Dates

Created:: 18/Feb/10 21:11

Updated:: 12/Jul/13 18:34

Resolved:: 12/Jun/11 18:19