[HDFS-5446] Consider supporting a mechanism to allow datanodes to drain outstanding work during rolling upgrade - ASF JIRA

XML

Word

Printable

JSON

Details

Type: Sub-task
Status: Resolved
Priority: Major
Resolution: Won't Fix
Affects Version/s: 2.2.0
Fix Version/s: None
Component/s: datanode
Labels:
None

Description

Rebuilding write pipelines is expensive and this can happen many times during a rolling restart of datanodes (i.e. during a rolling upgrade). It seems like it might help if datanodes could be told to drain current work while rejecting new requests - possibly with a new response indicating the node is temporarily unavailable (it's not broken, it's just going through a maintenance phase where it shouldn't accept new work).

Waiting just a few seconds is normally enough to clear up a good percentage of the open requests without error, thus reducing the overhead associated with restarting lots of datanodes in rapid succession.

Obviously would need a timeout to make sure the datanode doesn't wait forever.

Attachments

Activity

People

Assignee:: Unassigned

Reporter:: Nathan Roberts

Votes:: 1 Vote for this issue

Watchers:: 15 Start watching this issue

Dates

Created:: 31/Oct/13 17:53

Updated:: 13/Feb/14 13:50

Resolved:: 07/Feb/14 23:09