[HDFS-3726] QJM: if a logger misses an RPC, don't retry that logger until next segment - ASF JIRA

XML

Word

Printable

JSON

Details

Type: Sub-task
Status: Resolved
Priority: Major
Resolution: Fixed
Affects Version/s: QuorumJournalManager (HDFS-3077)
Fix Version/s: QuorumJournalManager (HDFS-3077)
Component/s: ha
Labels:
None

Target Version/s:

QuorumJournalManager (HDFS-3077)
Hadoop Flags:

Reviewed

Description

Currently, if a logger misses an RPC in the middle of a log segment, or misses the startLogSegment RPC (eg it was down or network was disconnected during that time period), then it will throw an exception on every subsequent journal() call in that segment, since it knows that it missed some edits in the middle.

We should change this exception to a specific IOE subclass, and have the client side of QJM detect the situation and stop sending IPCs until the next startLogSegment call.

This isn't critical for correctness but will help reduce log spew on both sides.

Attachments

- Sort By Name
- Sort By Date
- Ascending
- Descending

amend.txt
06/Sep/12 16:24
2 kB
Todd Lipcon
hdfs-3726.txt
06/Sep/12 00:44
13 kB
Todd Lipcon
hdfs-3726.txt
04/Sep/12 04:00
15 kB
Todd Lipcon

Activity

People

Assignee:: Todd Lipcon

Reporter:: Todd Lipcon

Votes:: 0 Vote for this issue

Watchers:: 3 Start watching this issue

Dates

Created:: 26/Jul/12 01:08

Updated:: 06/Sep/12 17:00

Resolved:: 06/Sep/12 07:00