[HBASE-9268] Client doesn't recover from a stalled region server - ASF JIRA

Voters

Watch issue

Watchers

Create sub-task

Link

Clone

Update Comment Author

Replace String in Comment

Update Comment Visibility

Delete Comments

XML

Word

Printable

JSON

Details

Type: Bug
Status: Closed
Priority: Major
Resolution: Fixed
Affects Version/s: 0.95.2
Fix Version/s: 0.98.0, 0.96.0
Component/s: None
Labels:
None

Hadoop Flags:

Reviewed

Description

Got this testing the 0.95.2 RC.

I killed -STOP a region server and let it stay like that while running PE. The clients didn't find the new region locations and in the jstack were stuck doing RPC. Eventually I killed -CONT and the client printed these:

Exception in thread "TestClient-6" java.lang.RuntimeException: org.apache.hadoop.hbase.client.RetriesExhaustedWithDetailsException: Failed 128 actions: IOException: 90 times, SocketTimeoutException: 38 times,

Attachments

- Sort By Name
- Sort By Date
- Ascending
- Descending

9268.v1.patch
23/Aug/13 09:52
2 kB
Nicolas Liochon
9268-hack.patch
21/Aug/13 15:31
0.8 kB
Nicolas Liochon

Activity

Comment

This comment will be Viewable by All Users Viewable by All Users

Cancel

People

Assignee:: Nicolas Liochon

Reporter:: Jean-Daniel Cryans

Votes:: 0 Vote for this issue

Watchers:: 9 Start watching this issue

Dates

Created:: 19/Aug/13 22:42

Updated:: 20/Nov/15 11:54

Resolved:: 24/Aug/13 06:02

Agile

View on Board

Client doesn't recover from a stalled region server

Details

Description

Attachments

Attachments

Activity

People

Dates

Agile

Slack

Issue deployment