[HDFS-3885] QJM: optimize log sync when JN is lagging behind - ASF JIRA

XML

Word

Printable

JSON

Details

Type: Sub-task
Status: Resolved
Priority: Major
Resolution: Fixed
Affects Version/s: QuorumJournalManager (HDFS-3077)
Fix Version/s: QuorumJournalManager (HDFS-3077)
Component/s: None
Labels:
None

Target Version/s:

QuorumJournalManager (HDFS-3077)
Hadoop Flags:

Reviewed

Description

This is a potential optimization that we can add to the JournalNode: when one of the nodes is lagging behind the others (eg because its local disk is slower or there was a network blip), it receives edits after they've been committed to a majority. It can tell this because the committed txid included in the request info is higher than the highest txid in the actual batch to be written. In this case, we know that this batch has already been fsynced to a quorum of nodes, so we can skip the fsync() on the laggy node, helping it to catch back up.

Attachments

- Sort By Name
- Sort By Date
- Ascending
- Descending

hdfs-3885.txt
07/Sep/12 02:57
11 kB
Todd Lipcon

Activity

People

Assignee:: Todd Lipcon

Reporter:: Todd Lipcon

Votes:: 0 Vote for this issue

Watchers:: 5 Start watching this issue

Dates

Created:: 01/Sep/12 07:40

Updated:: 10/Sep/12 18:51

Resolved:: 10/Sep/12 18:51