[HBASE-7590] Add a costless notifications mechanism from master to regionservers & clients - ASF JIRA

Details

Type: New Feature
Status: Closed
Priority: Major
Resolution: Fixed
Affects Version/s: 0.95.2
Fix Version/s: 0.98.0, 0.95.0
Component/s: Client, master, regionserver
Labels:
None

Hadoop Flags:

Reviewed
Release Note:

Hide
This allows to setup a multicast connection between the master and the hbase clients. With the feature on, when a regionserver is marked as dead by the master, the master sends as well a multicast message that will make the hbase client to disconnect immediately from the dead server instead of waiting for a socket timeout. Specifically, this allows to set hbase.rpc.timeout to larger values (like 5 minutes) without impacting the MTTR: without this, even if the dead regionserver data is now available on another server, the client stays on the dead one, waiting for an answer that will never come. It's a multicast message, hence cheap, scalable, but unreliable. For this reason, the master sends the information 5 times, to allow the hbase client to miss a message. This feature is NOT activated by default. To activate it, add to your hbase-site.xml:

  <property>
    <name>hbase.status.published</name>
    <value>true</value>
  </property>

You can as well configure the ip address and port used with the following setting:
<property>
<name>hbase.status.multicast.address.ip</name>
<value>226.1.1.3</value>
</property>

<property>
<name>hbase.status.multicast.address.port</name>
<value>6100</value>
</property>

Show
This allows to setup a multicast connection between the master and the hbase clients. With the feature on, when a regionserver is marked as dead by the master, the master sends as well a multicast message that will make the hbase client to disconnect immediately from the dead server instead of waiting for a socket timeout. Specifically, this allows to set hbase.rpc.timeout to larger values (like 5 minutes) without impacting the MTTR: without this, even if the dead regionserver data is now available on another server, the client stays on the dead one, waiting for an answer that will never come. It's a multicast message, hence cheap, scalable, but unreliable. For this reason, the master sends the information 5 times, to allow the hbase client to miss a message. This feature is NOT activated by default. To activate it, add to your hbase-site.xml:   <property>     <name>hbase.status.published</name>     <value>true</value>   </property> You can as well configure the ip address and port used with the following setting: <property> <name>hbase.status.multicast.address.ip</name> <value>226.1.1.3</value> </property> <property> <name>hbase.status.multicast.address.port</name> <value>6100</value> </property>
Tags:
0.96notable

Description

t would be very useful to add a mechanism to distribute some information to the clients and regionservers. Especially It would be useful to know globally (regionservers + clients apps) that some regionservers are dead. This would allow:

to lower the load on the system, without clients using staled information and going on dead machines
to make the recovery faster from a client point of view. It's common to use large timeouts on the client side, so the client may need a lot of time before declaring a region server dead and trying another one. If the client receives the information separatly about a region server states, it can take the right decision, and continue/stop to wait accordingly.

We can also send more information, for example instructions like 'slow down' to instruct the client to increase the retries delay and so on.

Technically, the master could send this information. To lower the load on the system, we should:

have a multicast communication (i.e. the master does not have to connect to all servers by tcp), with once packet every 10 seconds or so.
receivers should not depend on this: if the information is available great. If not, it should not break anything.
it should be optional.

So at the end we would have a thread in the master sending a protobuf message about the dead servers on a multicast socket. If the socket is not configured, it does not do anything. On the client side, when we receive an information that a node is dead, we refresh the cache about it.

Attachments

- Sort By Name
- Sort By Date
- Ascending
- Descending

7590.inprogress.patch
12/Feb/13 14:31
68 kB
Nicolas Liochon
7590.v1.patch
19/Feb/13 14:31
66 kB
Nicolas Liochon
7590.v12.patch
18/Mar/13 12:04
83 kB
Nicolas Liochon
7590.v12.patch
18/Mar/13 12:04
83 kB
Nicolas Liochon
7590.v13.patch
18/Mar/13 21:36
83 kB
Nicolas Liochon
7590.v1-rebased.patch
20/Feb/13 12:13
66 kB
Nicolas Liochon
7590.v2.patch
27/Feb/13 17:49
69 kB
Nicolas Liochon
7590.v3.patch
28/Feb/13 10:03
68 kB
Nicolas Liochon
7590.v5.patch
12/Mar/13 17:53
85 kB
Nicolas Liochon
7590.v5.patch
12/Mar/13 17:53
85 kB
Nicolas Liochon

Issue Links

breaks

HBASE-18390 Sleep too long when finding region location failed

Resolved

is required by

HBASE-5843 Improve HBase MTTR - Mean Time To Recover

Closed

relates to

HBASE-28941 Clear all meta caches of the server on which hardware failure related exceptions occurred

Patch Available

requires

HBASE-7756 Strange code in ServerCallable#shouldRetry

Closed

HBASE-7789 Clean DeadServer.java and add a Jitter method in ConnectionUtils

Closed

HBASE-7861 Use the ServerName in the Connection#getClient and Connection#getAdmin code

Closed

(1 requires)

Add a costless notifications mechanism from master to regionservers & clients

Details

Description

Attachments

Attachments

Issue Links

Activity

People

Dates