[HDFS-457] better handling of volume failure in Data Node storage - ASF JIRA

XML

Word

Printable

JSON

Details

Type: Improvement
Status: Closed
Priority: Major
Resolution: Fixed
Affects Version/s: None
Fix Version/s: 0.20.203.0, 0.21.0
Component/s: datanode
Labels:
None

Hadoop Flags:

Reviewed
Release Note:
Datanode can continue if a volume for replica storage fails. Previously a datanode resigned if any volume failed.

Description

Current implementation shuts DataNode down completely when one of the configured volumes of the storage fails.
This is rather wasteful behavior because it decreases utilization (good storage becomes unavailable) and imposes extra load on the system (replication of the blocks from the good volumes). These problems will become even more prominent when we move to mixed (heterogeneous) clusters with many more volumes per Data Node.

Attachments

- Sort By Name
- Sort By Date
- Ascending
- Descending

HDFS_457.patch
29/Jun/10 07:24
2 kB
Jeff Zhang
HDFS-457_20-append.patch
11/Jun/10 00:08
27 kB
Nicolas Spiegelberg
HDFS-457.patch
17/Jul/09 19:53
29 kB
Boris Shkolnik
HDFS-457-1.patch
07/Aug/09 22:37
29 kB
Boris Shkolnik
HDFS-457-2.patch
15/Aug/09 00:40
29 kB
Boris Shkolnik
HDFS-457-2.patch
14/Aug/09 00:10
29 kB
Boris Shkolnik
HDFS-457-2.patch
13/Aug/09 21:16
28 kB
Boris Shkolnik
HDFS-457-3.patch
17/Aug/09 22:29
29 kB
Boris Shkolnik
HDFS-457-y20.patch
21/Jul/10 01:59
15 kB
Konstantin Shvachko
jira.HDFS-457.branch-0.20-internal.patch
11/Nov/09 23:43
16 kB
Erik Steffl
TestFsck.zip
14/Aug/09 21:28
689 kB
Tsz-wo Sze

Issue Links

is related to

HDFS-138 data node process should not die if one dir goes bad

Resolved

relates to

HDFS-1158 HDFS-457 increases the chances of losing blocks

Resolved

HDFS-1273 Handle disk failure when writing new blocks on datanode

Resolved

HDFS-612 FSDataset should not use org.mortbay.log.Log

Closed

HDFS-811 Add metrics, failure reporting and additional tests for HDFS-457

Resolved

Activity

People

Assignee:: Boris Shkolnik

Reporter:: Boris Shkolnik

Votes:: 0 Vote for this issue

Watchers:: 15 Start watching this issue

Dates

Created:: 30/Jun/09 18:57

Updated:: 21/May/12 19:15

Resolved:: 17/Aug/09 23:10