[HDFS-2093] 1073: Handle case where an entirely empty log is left during NN crash - ASF JIRA

XML

Word

Printable

JSON

Details

Type: Sub-task
Status: Resolved
Priority: Major
Resolution: Fixed
Affects Version/s: Edit log branch (HDFS-1073)
Fix Version/s: Edit log branch (HDFS-1073)
Component/s: namenode
Labels:
None

Hadoop Flags:

Reviewed

Description

In fault-testing the ~~HDFS-1073~~ branch, I saw the following situation:

NN has two storage directories, but one is in failed state
NN starts to roll edits logs to edits_inprogress_5160285
NN then crashes
on restart, it detects the truncated log, but since it has 0 txns, it finalizes it to the nonsense log name edits_5160285-5160284.
It then starts logs again at edits_inprogress_5160285.
After this point, no checkpoints or future NN startups succeed since there are two logs starting with the same txid

Attachments

Attachments

- Sort By Name
- Sort By Date
- Ascending
- Descending

hdfs-2093.txt
21/Jun/11 05:26
5 kB
Todd Lipcon
hdfs-2093.txt
21/Jun/11 06:40
7 kB
Todd Lipcon
hdfs-2093.txt
21/Jun/11 22:27
12 kB
Todd Lipcon
hdfs-2093.txt
22/Jun/11 00:44
12 kB
Todd Lipcon

Activity

People

Assignee:: Todd Lipcon

Reporter:: Todd Lipcon

Votes:: 0 Vote for this issue

Watchers:: 1 Start watching this issue

Dates

Created:: 21/Jun/11 05:00

Updated:: 24/Jun/11 23:32

Resolved:: 24/Jun/11 23:32