[HADOOP-4679] Datanode prints tons of log messages: Waiting for threadgroup to exit, active theads is XX - ASF JIRA

XML

Word

Printable

JSON

Details

Type: Bug
Status: Closed
Priority: Major
Resolution: Fixed
Affects Version/s: None
Fix Version/s: 0.18.3
Component/s: None
Labels:
None

Hadoop Flags:

Reviewed
Release Note:

Hide
1. Only datanode's offerService thread shutdown the datanode to avoid deadlock;
2. Datanode checks disk in case of failure on creating a block file.

Show
1. Only datanode's offerService thread shutdown the datanode to avoid deadlock; 2. Datanode checks disk in case of failure on creating a block file.

Description

When a data receiver thread sees a disk error, it immediately calls shutdown to shutdown DataNode. But the shutdown method does not return before all data receiver threads exit, which will never happen. Therefore the DataNode gets into a dead/live lock state, emitting tons of log messages: Waiting for threadgroup to exit, active threads is XX.

Attachments

- Sort By Name
- Sort By Date
- Ascending
- Descending

diskError.patch
25/Nov/08 01:24
6 kB
Hairong Kuang
diskError1.patch
25/Nov/08 18:47
6 kB
Hairong Kuang
diskError2.patch
26/Nov/08 22:21
8 kB
Hairong Kuang
diskError3.patch
02/Dec/08 21:59
9 kB
Hairong Kuang
diskError3-br18.patch
03/Dec/08 20:02
7 kB
Hairong Kuang

Issue Links

is related to

HADOOP-4962 HADOOP-4679 to be fixed for branches >= 0.19

Closed

relates to

HDFS-264 Better Datanode DiskOutOfSpaceException handling.

Resolved

HADOOP-5114 A bunch of mapred unit tests are failing on Windows

Resolved

Activity

People

Assignee:: Hairong Kuang

Reporter:: Hairong Kuang

Votes:: 0 Vote for this issue

Watchers:: 0 Start watching this issue

Dates

Created:: 18/Nov/08 19:27

Updated:: 08/Jul/09 16:43

Resolved:: 03/Dec/08 20:07