[HDFS-5776] Support 'hedged' reads in DFSClient - ASF JIRA

Details

Type: Improvement
Status: Closed
Priority: Major
Resolution: Fixed
Affects Version/s: 3.0.0-alpha1
Fix Version/s: 2.4.0
Component/s: hdfs-client
Labels:
None

Hadoop Flags:

Reviewed
Release Note:

Hide
If a read from a block is slow, start up another parallel, 'hedged' read against a different block replica. We then take the result of which ever read returns first (the outstanding read is cancelled). This 'hedged' read feature will help rein in the outliers, the odd read that takes a long time because it hit a bad patch on the disc, etc.

This feature is off by default. To enable this feature, set <code>dfs.client.hedged.read.threadpool.size</code> to a positive number. The threadpool size is how many threads to dedicate to the running of these 'hedged', concurrent reads in your client.

Then set <code>dfs.client.hedged.read.threshold.millis</code> to the number of milliseconds to wait before starting up a 'hedged' read. For example, if you set this property to 10, then if a read has not returned within 10 milliseconds, we will start up a new read against a different block replica.

This feature emits new metrics:

+ hedgedReadOps
+ hedgeReadOpsWin -- how many times the hedged read 'beat' the original read
+ hedgedReadOpsInCurThread -- how many times we went to do a hedged read but we had to run it in the current thread because dfs.client.hedged.read.threadpool.size was at a maximum.

Show
If a read from a block is slow, start up another parallel, 'hedged' read against a different block replica. We then take the result of which ever read returns first (the outstanding read is cancelled). This 'hedged' read feature will help rein in the outliers, the odd read that takes a long time because it hit a bad patch on the disc, etc. This feature is off by default. To enable this feature, set <code>dfs.client.hedged.read.threadpool.size</code> to a positive number. The threadpool size is how many threads to dedicate to the running of these 'hedged', concurrent reads in your client. Then set <code>dfs.client.hedged.read.threshold.millis</code> to the number of milliseconds to wait before starting up a 'hedged' read. For example, if you set this property to 10, then if a read has not returned within 10 milliseconds, we will start up a new read against a different block replica. This feature emits new metrics: + hedgedReadOps + hedgeReadOpsWin -- how many times the hedged read 'beat' the original read + hedgedReadOpsInCurThread -- how many times we went to do a hedged read but we had to run it in the current thread because dfs.client.hedged.read.threadpool.size was at a maximum.

Description

This is a placeholder of hdfs related stuff backport from https://issues.apache.org/jira/browse/HBASE-7509

The quorum read ability should be helpful especially to optimize read outliers

we can utilize "dfs.dfsclient.quorum.read.threshold.millis" & "dfs.dfsclient.quorum.read.threadpool.size" to enable/disable the hedged read ability from client side(e.g. HBase), and by using DFSQuorumReadMetrics, we could export the interested metric valus into client system(e.g. HBase's regionserver metric).

The core logic is in pread code path, we decide to goto the original fetchBlockByteRange or the new introduced fetchBlockByteRangeSpeculative per the above config items.

Attachments

- Sort By Name
- Sort By Date
- Ascending
- Descending

HDFS-5776.txt
16/Jan/14 07:52
29 kB
Liang Xie
HDFS-5776-v10.txt
28/Jan/14 02:14
31 kB
Liang Xie
HDFS-5776-v11.txt
29/Jan/14 14:30
31 kB
Liang Xie
HDFS-5776-v12.txt
29/Jan/14 22:17
31 kB
Michael Stack
HDFS-5776-v12.txt
29/Jan/14 18:37
31 kB
Michael Stack
HDFS-5776-v13.wip.txt
05/Feb/14 08:27
31 kB
Michael Stack
HDFS-5776-v14.txt
08/Feb/14 05:15
30 kB
Michael Stack
HDFS-5776-v15.txt
10/Feb/14 04:58
32 kB
Michael Stack
HDFS-5776-v17.txt
13/Feb/14 03:35
33 kB
Michael Stack
HDFS-5776-v17.txt
12/Feb/14 23:41
33 kB
Michael Stack
HDFS-5776v18.txt
14/Feb/14 22:33
34 kB
Michael Stack
HDFS-5776-v2.txt
17/Jan/14 07:41
29 kB
Liang Xie
HDFS-5776v21.txt
19/Feb/14 19:42
35 kB
Michael Stack
HDFS-5776v21-branch2.txt
24/Feb/14 22:54
34 kB
Michael Stack
HDFS-5776-v3.txt
20/Jan/14 09:47
30 kB
Liang Xie
HDFS-5776-v4.txt
21/Jan/14 06:39
30 kB
Liang Xie
HDFS-5776-v5.txt
22/Jan/14 07:46
29 kB
Liang Xie
HDFS-5776-v6.txt
23/Jan/14 10:37
30 kB
Liang Xie
HDFS-5776-v7.txt
24/Jan/14 06:16
29 kB
Liang Xie
HDFS-5776-v8.txt
24/Jan/14 07:16
30 kB
Liang Xie
HDFS-5776-v9.txt
27/Jan/14 06:56
31 kB
Liang Xie

Issue Links

is depended upon by

HBASE-7509 Enable RS to query a secondary datanode in parallel, if the primary takes too long

Closed

is related to

HDFS-8955 Support 'hedged' write in DFSClient

Open

relates to

HDFS-6231 DFSClient hangs infinitely if using hedged reads and all eligible datanodes die.

Closed

HDFS-6450 Support non-positional hedged reads in HDFS

Patch Available

Support 'hedged' reads in DFSClient

Details

Description

Attachments

Attachments

Issue Links

Activity

People

Dates