[HDFS-3990] NN's health report has severe performance problems - ASF JIRA

XML

Word

Printable

JSON

Details

Type: Bug
Status: Closed
Priority: Critical
Resolution: Fixed
Affects Version/s: 0.23.0, 2.0.0-alpha, 3.0.0-alpha1
Fix Version/s: 2.0.3-alpha, 0.23.5
Component/s: namenode
Labels:
None

Target Version/s:

0.23.4, 2.0.3-alpha
Hadoop Flags:

Reviewed

Description

The dfshealth page will place a read lock on the namespace while it does a dns lookup for every DN. On a multi-thousand node cluster, this often results in 10s+ load time for the health page. 10 concurrent requests were found to cause 7m+ load times during which time write operations blocked.

Attachments

- Sort By Name
- Sort By Date
- Ascending
- Descending

HDFS-3990.branch-0.23.patch
30/Oct/12 17:27
12 kB
Daryn Sharp
HDFS-3990.branch-0.23.patch
18/Oct/12 00:31
13 kB
Daryn Sharp
HDFS-3990.patch
30/Oct/12 17:28
9 kB
Daryn Sharp
HDFS-3990.patch
22/Oct/12 19:40
9 kB
Daryn Sharp
HDFS-3990.patch
17/Oct/12 21:58
10 kB
Daryn Sharp
HDFS-3990.patch
17/Oct/12 14:19
12 kB
Daryn Sharp
HDFS-3990.patch
16/Oct/12 22:30
11 kB
Daryn Sharp
HDFS-3990.patch
16/Oct/12 18:09
11 kB
Daryn Sharp
HDFS-3990.patch
15/Oct/12 14:58
13 kB
Daryn Sharp
HDFS-3990.patch
12/Oct/12 19:42
10 kB
Daryn Sharp
hdfs-3990.txt
17/Oct/12 06:03
13 kB
Eli Collins
hdfs-3990.txt
15/Oct/12 22:55
4 kB
Eli Collins

Issue Links

breaks

HDFS-4269 DatanodeManager#registerDatanode rejects all datanode registrations from localhost in single-node developer setup

Closed

HDFS-5338 Add a conf to disable hostname check in DN registration

Closed

relates to

HDFS-4702 remove namesystem lock from DatanodeManager#fetchDatanodes

Resolved

HDFS-3998 Speed up fsck

Open

requires

HDFS-4068 DatanodeID and DatanodeInfo member should be private

Closed

Activity

People

Assignee:: Daryn Sharp

Reporter:: Daryn Sharp

Votes:: 0 Vote for this issue

Watchers:: 22 Start watching this issue

Dates

Created:: 28/Sep/12 13:40

Updated:: 12/May/16 18:12

Resolved:: 09/Nov/12 01:01