[HDFS-3049] During the normal loading NN startup process, fall back on a different EditLog if we see one that is corrupt - ASF JIRA

Details

Type: New Feature
Status: Closed
Priority: Minor
Resolution: Fixed
Affects Version/s: 0.23.0
Fix Version/s: 2.0.3-alpha
Component/s: namenode
Labels:
None

Target Version/s:

2.0.3-alpha
Hadoop Flags:

Reviewed

Description

During the NameNode startup process, we load an image, and then apply edit logs to it until we believe that we have all the latest changes. Unfortunately, if there is an I/O error while reading any of these files, in most cases, we simply abort the startup process. We should try harder to locate a readable edit log and/or image file.

There are three main use cases for this feature:
1. If the operating system does not honor fsync (usually due to a misconfiguration), a file may end up in an inconsistent state.
2. In certain older releases where we did not use fallocate() or similar to pre-reserve blocks, a disk full condition may cause a truncated log in one edit directory.
3. There may be a bug in HDFS which results in some of the data directories receiving corrupt data, but not all. This is the least likely use case.

Proposed changes to normal NN startup

We should try a different FSImage if we can't load the first one we try.
We should examine other FSEditLogs if we can't load the first one(s) we try.
We should fail if we can't find EditLogs that would bring us up to what we believe is the latest transaction ID.

Proposed changes to recovery mode NN startup:
we should list out all the available storage directories and allow the operator to select which one he wants to use.
Something like this:

Multiple storage directories found.
1. /foo/bar
    edits__curent__XYZ          size:213421345       md5:2345345
    image                                  size:213421345       md5:2345345
2. /foo/baz
    edits__curent__XYZ          size:213421345       md5:2345345345
    image                                  size:213421345       md5:2345345
Which one would you like to use? (1/2)

As usual in recovery mode, we want to be flexible about error handling. In this case, this means that we should NOT fail if we can't find EditLogs that would bring us up to what we believe is the latest transaction ID.

Not addressed by this feature
This feature will not address the case where an attempt to access the NameNode name directory or directories hangs because of an I/O error. This may happen, for example, when trying to load an image from a hard-mounted NFS directory, when the NFS server has gone away. Just as now, the operator will have to notice this problem and take steps to correct it.

Attachments

- Sort By Name
- Sort By Date
- Ascending
- Descending

HDFS-3049.001.patch
13/Apr/12 23:27
22 kB
Colin McCabe
HDFS-3049.002.patch
16/Apr/12 22:41
23 kB
Colin McCabe
HDFS-3049.003.patch
17/Apr/12 19:24
23 kB
Colin McCabe
HDFS-3049.005.against3335.patch
10/May/12 22:09
53 kB
Colin McCabe
HDFS-3049.006.against3335.patch
11/May/12 19:20
69 kB
Colin McCabe
HDFS-3049.007.against3335.patch
14/May/12 17:58
72 kB
Colin McCabe
HDFS-3049.010.patch
15/May/12 17:25
79 kB
Colin McCabe
HDFS-3049.011.patch
15/May/12 19:39
79 kB
Colin McCabe
HDFS-3049.012.patch
15/May/12 21:38
88 kB
Colin McCabe
HDFS-3049.013.patch
16/May/12 00:04
88 kB
Colin McCabe
HDFS-3049.015.patch
16/May/12 22:52
114 kB
Colin McCabe
HDFS-3049.017.patch
17/May/12 07:14
117 kB
Colin McCabe
HDFS-3049.018.patch
17/May/12 18:37
118 kB
Colin McCabe
HDFS-3049.021.patch
18/May/12 03:17
87 kB
Colin McCabe
HDFS-3049.023.patch
24/May/12 20:34
37 kB
Colin McCabe
HDFS-3049.025.patch
25/May/12 03:50
39 kB
Colin McCabe
HDFS-3049.026.patch
29/May/12 18:09
39 kB
Colin McCabe
HDFS-3049.027.patch
04/Jun/12 21:45
41 kB
Colin McCabe
HDFS-3049.028.patch
07/Jun/12 01:25
40 kB
Colin McCabe
HDFS-3049.028.patch
07/Jun/12 18:20
40 kB
Colin McCabe
HDFS-3049.028.patch
08/Jun/12 00:34
40 kB
Colin McCabe
hdfs-3049-branch-2.txt
04/Dec/12 00:49
28 kB
Todd Lipcon

Issue Links

breaks

HDFS-3614 Revert unused MiniDFSCluster constructor from HDFS-3049

Resolved

is related to

HDFS-3277 fail over to loading a different FSImage if the first one we try to load is corrupt

Closed

HDFS-3440 should more effectively limit stream memory consumption when reading corrupt edit logs

Closed

HDFS-3853 Port MiniDFSCluster enableManagedDfsDirsRedundancy option to branch-2

Closed

HDFS-3004 Implement Recovery Mode

Closed

relates to

HDFS-2797 Fix misuses of InputStream#skip in the edit log code

Closed

supercedes

HDFS-2982 Startup performance suffers when there are many edit log segments

Closed

(1 relates to, 1 supercedes)

During the normal loading NN startup process, fall back on a different EditLog if we see one that is corrupt

Details

Description

Attachments

Attachments

Issue Links

Activity

People

Dates