[HDFS-9826] Erasure Coding: Postpone the recovery work for a configurable time period - ASF JIRA

XML

Word

Printable

JSON

Details

Type: Sub-task
Status: Resolved
Priority: Major
Resolution: Not A Problem
Affects Version/s: None
Fix Version/s: None
Component/s: None
Labels:
None

Description

Currently NameNode prepares recovering when finding an under replicated block group. This is inefficient and reduces resources for other operations. It would be better to postpone the recovery work for a period of time if only one internal block is corrupted considering points shown by papers such as [1][2]:
1. Transient errors in which no data are lost account for more than 90% of data center failures, owing to network partitions, software problems, or non-disk hardware faults.
2. Although erasure codes tolerate multiple simultaneous failures, single failures represent 99.75% of recoveries.

Different clusters may have different status, so we should allow user to configure the time for postponing the recoveries. Proper configuration will reduce a large proportion of unnecessary recoveries. When finding multiple internal blocks corrupted in a block group, we prepare the recovery work immediately because it’s very rare and we don’t want to increase the risk of losing data.

[1] Availability in globally distributed storage systems
http://static.usenix.org/events/osdi10/tech/full_papers/Ford.pdf
[2] Rethinking erasure codes for cloud file systems: minimizing I/O for recovery and degraded reads
http://static.usenix.org/events/fast/tech/full_papers/Khan.pdf

Attachments

- Sort By Name
- Sort By Date
- Ascending
- Descending

HDFS-9826-001.patch
01/Mar/16 02:33
7 kB
Li Bo
HDFS-9826-002.patch
09/Mar/16 03:59
8 kB
Li Bo

Activity

People

Assignee:: Li Bo

Reporter:: Li Bo

Votes:: 0 Vote for this issue

Watchers:: 4 Start watching this issue

Dates

Created:: 18/Feb/16 08:32

Updated:: 11/Jan/17 01:56

Resolved:: 11/Jan/17 01:56