[HBASE-9843] Various fixes in client code - ASF JIRA

XML

Word

Printable

JSON

Details

Type: Bug
Status: Closed
Priority: Major
Resolution: Fixed
Affects Version/s: 0.96.0
Fix Version/s: 0.98.0, 0.96.1
Component/s: Client
Labels:
None

Hadoop Flags:

Reviewed

Description

This mainly fixes issues when we had "long" errors, for example a multi blocked when trying to obtain a lock that was finally failing after 60s. Previously we were trying only for 5 minutes. We now do all the tries. I've fixed stuff around this area to make it work.

There is also more logs.

I've changed the back off array. With the default pause of 100ms, even after 20 tries we still retry every 10s.

I've also changed the max per RS to something minimal. If the cluster is not in a very good state it's less aggressive. It seems to be a better default.

I've done two tests:

on a small; homogeneous cluster, I had the same performances
on a bigger, but heterogeneous cluster it was twice as fast.

Attachments

- Sort By Name
- Sort By Date
- Ascending
- Descending

9843-trunk.v2.patch
25/Oct/13 19:48
29 kB
Nicolas Liochon
9843-trunk.v3.patch
25/Oct/13 23:47
30 kB
Nicolas Liochon

Issue Links

contains

HBASE-9787 HCM should not stop retrying after retry timeout if the retry count is not exhausted

Closed

Activity

People

Assignee:: Nicolas Liochon

Reporter:: Nicolas Liochon

Votes:: 0 Vote for this issue

Watchers:: 9 Start watching this issue

Dates

Created:: 25/Oct/13 17:05

Updated:: 16/Dec/13 18:46

Resolved:: 30/Oct/13 13:08