[HDFS-10301] BlockReport retransmissions may lead to storages falsely being declared zombie if storage report processing happens out of order - ASF JIRA

XML

Word

Printable

JSON

Details

Type: Bug
Status: Resolved
Priority: Critical
Resolution: Fixed
Affects Version/s: 2.6.1
Fix Version/s: 2.8.0, 2.7.4, 3.0.0-alpha2
Component/s: namenode
Labels:
None

Target Version/s:

2.8.0, 2.7.4, 3.0.0-alpha2, 2.6.6
Hadoop Flags:

Reviewed

Description

When NameNode is busy a DataNode can timeout sending a block report. Then it sends the block report again. Then NameNode while process these two reports at the same time can interleave processing storages from different reports. This screws up the blockReportId field, which makes NameNode think that some storages are zombie. Replicas from zombie storages are immediately removed, causing missing blocks.

Attachments

- Sort By Name
- Sort By Date
- Ascending
- Descending

HDFS-10301.002.patch
21/Apr/16 20:17
11 kB
Colin McCabe
HDFS-10301.003.patch
21/Apr/16 23:45
16 kB
Colin McCabe
HDFS-10301.004.patch
24/May/16 02:06
29 kB
Vinitha Reddy Gankidi
HDFS-10301.005.patch
24/May/16 17:31
16 kB
Colin McCabe
HDFS-10301.006.patch
16/Jun/16 05:39
31 kB
Vinitha Reddy Gankidi
HDFS-10301.007.patch
23/Jun/16 23:38
32 kB
Vinitha Reddy Gankidi
HDFS-10301.008.patch
13/Jul/16 18:17
32 kB
Vinitha Reddy Gankidi
HDFS-10301.009.patch
18/Jul/16 20:00
32 kB
Vinitha Reddy Gankidi
HDFS-10301.01.patch
19/Apr/16 06:33
16 kB
Walter Su
HDFS-10301.010.patch
18/Jul/16 22:48
32 kB
Vinitha Reddy Gankidi
HDFS-10301.011.patch
19/Jul/16 22:02
32 kB
Vinitha Reddy Gankidi
HDFS-10301.012.patch
21/Jul/16 00:54
32 kB
Vinitha Reddy Gankidi
HDFS-10301.013.patch
06/Aug/16 01:21
38 kB
Vinitha Reddy Gankidi
HDFS-10301.014.patch
13/Sep/16 01:21
25 kB
Vinitha Reddy Gankidi
HDFS-10301.015.patch
13/Oct/16 19:36
26 kB
Vinitha Reddy Gankidi
HDFS-10301.branch-2.015.patch
15/Oct/16 01:35
24 kB
Konstantin Shvachko
HDFS-10301.branch-2.7.015.patch
17/Oct/16 21:27
20 kB
Vinitha Reddy Gankidi
HDFS-10301.branch-2.7.patch
29/Jul/16 22:57
28 kB
Vinitha Reddy Gankidi
HDFS-10301.branch-2.patch
26/Jul/16 02:07
30 kB
Konstantin Shvachko
HDFS-10301.sample.patch
04/May/16 16:13
4 kB
Daryn Sharp
zombieStorageLogs.rtf
17/Apr/16 20:35
32 kB
Konstantin Shvachko

Issue Links

blocks

HDFS-10953 Remove single-rpc block reports.

Open

is broken by

HDFS-7960 The full block report should prune zombie storages even if they're not empty

Closed

Activity

People

Assignee:: Vinitha Reddy Gankidi

Reporter:: Konstantin Shvachko

Votes:: 0 Vote for this issue

Watchers:: 32 Start watching this issue

Dates

Created:: 17/Apr/16 19:50

Updated:: 18/Oct/16 02:09

Resolved:: 18/Oct/16 01:42