[HDFS-1981] When namenode goes down while checkpointing and if is started again subsequent Checkpointing is always failing - ASF JIRA

XML

Word

Printable

JSON

Details

Type: Bug
Status: Closed
Priority: Blocker
Resolution: Fixed
Affects Version/s: 0.22.0
Fix Version/s: 0.22.0
Component/s: namenode
Labels:
None
Environment:

Linux

Hadoop Flags:

Reviewed

Description

This scenario is applicable in NN and BNN case.

When the namenode goes down after creating the edits.new, on subsequent restart the divertFileStreams will not happen to edits.new as the edits.new file is already present and the size is zero.

so on trying to saveCheckPoint an exception occurs
2011-05-23 16:38:57,476 WARN org.mortbay.log: /getimage: java.io.IOException: GetImage failed. java.io.IOException: Namenode has an edit log with timestamp of 2011-05-23 16:38:56 but new checkpoint was created using editlog with timestamp 2011-05-23 16:37:30. Checkpoint Aborted.

This is a bug or is that the behaviour.

Attachments

- Sort By Name
- Sort By Date
- Ascending
- Descending

HDFS-1981_0.22.patch
27/Jul/11 12:33
4 kB
Uma Maheswara Rao G
HDFS-1981_0.23.patch
26/Jul/11 17:38
4 kB
Uma Maheswara Rao G
HDFS-1981.patch
15/Jun/11 14:26
5 kB
ramkrishna.s.vasudevan
HDFS-1981-1.patch
28/Jun/11 14:27
5 kB
ramkrishna.s.vasudevan
HDFS-1981-2.patch
08/Jul/11 15:15
4 kB
ramkrishna.s.vasudevan

Issue Links

relates to

HADOOP-5314 needToSave incorrectly calculated in loadFSImage()

Closed

Activity

People

Assignee:: Uma Maheswara Rao G

Reporter:: ramkrishna.s.vasudevan

Votes:: 0 Vote for this issue

Watchers:: 8 Start watching this issue

Dates

Created:: 23/May/11 11:13

Updated:: 12/Dec/11 06:19

Resolved:: 27/Jul/11 23:35