[HDFS-6229] Race condition in failover can cause RetryCache fail to work - ASF JIRA

Voters

Watch issue

Watchers

Create sub-task

Link

Clone

Update Comment Author

Replace String in Comment

Update Comment Visibility

Delete Comments

XML

Word

Printable

JSON

Details

Type: Bug
Status: Closed
Priority: Major
Resolution: Fixed
Affects Version/s: 2.1.0-beta
Fix Version/s: 2.4.1
Component/s: ha
Labels:
None

Hadoop Flags:

Reviewed

Description

Currently when NN failover happens, the old SBN first sets its state to active, then starts the active services (including tailing all the remaining editlog and building a complete retry cache based on the editlog). If a retry request, which has already succeeded in the old ANN (but the client fails to receive the response), comes in between, this retry may still get served by the new ANN but miss the retry cache.

Attachments

- Sort By Name
- Sort By Date
- Ascending
- Descending

HDFS-6229.000.patch
10/Apr/14 20:09
5 kB
Jing Zhao
retrycache-race.patch
10/Apr/14 18:22
2 kB
Jing Zhao

Activity

Comment

This comment will be Viewable by All Users Viewable by All Users

Cancel

People

Assignee:: Jing Zhao

Reporter:: Jing Zhao

Votes:: 0 Vote for this issue

Watchers:: 4 Start watching this issue

Dates

Created:: 10/Apr/14 18:05

Updated:: 30/Jun/14 08:18

Resolved:: 11/Apr/14 16:48

Agile

View on Board

Race condition in failover can cause RetryCache fail to work

Details

Description

Attachments

Attachments

Activity

People

Dates

Agile

Slack

Issue deployment