[HDFS-7915] The DataNode can sometimes allocate a ShortCircuitShm slot and fail to tell the DFSClient about it because of a network error - ASF JIRA

Voters

Watch issue

Watchers

Create sub-task

Link

Clone

Update Comment Author

Replace String in Comment

Update Comment Visibility

Delete Comments

XML

Word

Printable

JSON

Details

Type: Bug
Status: Closed
Priority: Major
Resolution: Fixed
Affects Version/s: 2.7.0
Fix Version/s: 2.7.0, 2.6.1, 3.0.0-alpha1
Component/s: None
Labels:
- 2.6.1-candidate

Target Version/s:

2.7.0

Description

The DataNode can sometimes allocate a ShortCircuitShm slot and fail to tell the DFSClient about it because of a network error. In DataXceiver#requestShortCircuitFds, the DataNode can succeed at the first part (mark the slot as used) and fail at the second part (tell the DFSClient what it did). The "try" block for unregistering the slot only covers a failure in the first part, not the second part. In this way, a divergence can form between the views of which slots are allocated on DFSClient and on server.

Attachments

Attachments

- Sort By Name
- Sort By Date
- Ascending
- Descending

HDFS-7915.001.patch
11/Mar/15 10:01
13 kB
Colin McCabe
HDFS-7915.002.patch
12/Mar/15 00:08
13 kB
Colin McCabe
HDFS-7915.004.patch
12/Mar/15 23:47
21 kB
Colin McCabe
HDFS-7915.005.patch
14/Mar/15 01:28
21 kB
Colin McCabe
HDFS-7915.006.patch
14/Mar/15 02:51
21 kB
Colin McCabe
HDFS-7915.branch-2.6.patch
14/Aug/15 04:19
21 kB
Akira Ajisaka

Issue Links

Add Link

breaks

HDFS-8070 Pre-HDFS-7915 DFSClient cannot use short circuit on post-HDFS-7915 DataNode

Closed

Delete this link

is broken by

HDFS-9466 TestShortCircuitCache#testDataXceiverCleansUpSlotsOnFailure is flaky

Resolved

Delete this link

is related to

HADOOP-11802 DomainSocketWatcher thread terminates sometimes after there is an I/O error during requestShortCircuitShm

Closed

Delete this link

Activity

Comment

This comment will be Viewable by All Users Viewable by All Users

Cancel

People

Assignee:: Colin McCabe

Reporter:: Colin McCabe

Votes:: 0 Vote for this issue

Watchers:: 16 Start watching this issue

Dates

Created:: 11/Mar/15 06:09

Updated:: 30/Jun/17 13:36

Resolved:: 15/Mar/15 08:42

Agile

Slack

Issue deployment