[HDFS-2243] DataXceiver per accept seems to be a bottleneck in HBase/YCSB test - ASF JIRA

XML

Word

Printable

JSON

Details

Type: Bug
Status: Resolved
Priority: Minor
Resolution: Won't Fix
Affects Version/s: 0.23.0
Fix Version/s: None
Component/s: datanode
Labels:
None
Environment:

Using Fedora 14 on a quad core phenom system

Description

I am running the YCSB benchmark against HBase, sometimes against a single node, sometimes against a cluster of 6 systems. As the load increases into thousands of TPS, especially on the single node, I can see that the datanode runs very high system time and seems to be bottlenecked by how fast it can create the threads to handle the new connections in DataXceiverServer.run. By "perf top" I can see the process spends about 12% of all its time in pthread_create, and in hprof profiles I can see there are tens of thousands of threads created in just a few minutes of test execution.

Does anyone else observe this bottleneck? Is there a major challenge to using a thread pool of DataXceivers in this situation?

Attachments

- Sort By Name
- Sort By Date
- Ascending
- Descending

datanode-perf-110808.gif
10/Aug/11 20:31
9 kB
Eric Caspole
HDFS-2234-branch-0.20-append.patch
23/Aug/11 15:42
10 kB
Eric Caspole
HDFS-2243-0.23-110909.patch
16/Sep/11 15:19
3 kB
Eric Caspole
HDFS-2243-0.23-110909.txt
12/Sep/11 13:47
3 kB
Eric Caspole

Issue Links

relates to

HDFS-918 Use single Selector and small thread pool to replace many instances of BlockSender for reads

Open

Activity

People

Assignee:: Unassigned

Reporter:: Eric Caspole

Votes:: 0 Vote for this issue

Watchers:: 6 Start watching this issue

Dates

Created:: 10/Aug/11 19:37

Updated:: 28/Sep/15 21:08

Resolved:: 26/Sep/11 13:37