[HDFS-4025] QJM: Sychronize past log segments to JNs that missed them - ASF JIRA

XML

Word

Printable

JSON

Details

Type: Sub-task
Status: Resolved
Priority: Major
Resolution: Fixed
Affects Version/s: QuorumJournalManager (HDFS-3077)
Fix Version/s: QuorumJournalManager (HDFS-3077), 3.0.0-alpha4
Component/s: ha
Labels:
None

Hadoop Flags:

Reviewed

Description

Currently, if a JournalManager crashes and misses some segment of logs, and then comes back, it will be re-added as a valid part of the quorum on the next log roll. However, it will not have a complete history of log segments (i.e any individual JN may have gaps in its transaction history). This mirrors the behavior of the NameNode when there are multiple local directories specified.

However, it would be better if a background thread noticed these gaps and "filled them in" by grabbing the segments from other JournalNodes. This increases the resilience of the system when JournalNodes get reformatted or otherwise lose their local disk.

Attachments

- Sort By Name
- Sort By Date
- Ascending
- Descending

HDFS-4025.000.patch
27/Aug/16 01:41
24 kB
Hanisha Koneru
HDFS-4025.001.patch
31/Aug/16 01:21
34 kB
Hanisha Koneru
HDFS-4025.002.patch
01/Sep/16 17:56
35 kB
Hanisha Koneru
HDFS-4025.003.patch
01/Sep/16 20:52
34 kB
Hanisha Koneru
HDFS-4025.004.patch
21/Dec/16 22:37
38 kB
Hanisha Koneru
HDFS-4025.005.patch
22/Dec/16 01:23
39 kB
Hanisha Koneru
HDFS-4025.006.patch
10/Jan/17 21:06
34 kB
Hanisha Koneru
HDFS-4025.007.patch
17/Jan/17 03:12
38 kB
Hanisha Koneru
HDFS-4025.008.patch
26/Jan/17 05:11
46 kB
Hanisha Koneru
HDFS-4025.009.patch
01/Feb/17 01:33
47 kB
Hanisha Koneru
HDFS-4025.010.patch
01/Feb/17 23:29
47 kB
Hanisha Koneru
HDFS-4025.011.patch
03/Feb/17 19:21
47 kB
Hanisha Koneru

Issue Links

blocks

HDFS-10659 Namenode crashes after Journalnode re-installation in an HA cluster due to missing paxos directory

Resolved

is blocked by

HDFS-11273 Move TransferFsImage#doGetUrl function to a Util class

Resolved

is related to

HDFS-12358 Handle IOException when transferring edit log to Journal current dir through JN sync

Resolved

HDFS-12356 Unit test for JournalNode sync during Rolling Upgrade

Resolved

HDFS-14942 Change Log Level to debug in JournalNodeSyncer#syncWithJournalAtIndex

Resolved

relates to

HDFS-11448 JN log segment syncing should support HA upgrade

Resolved

HDFS-11866 JournalNode Sync should be off by default in hdfs-default.xml

Resolved

HDFS-14140 JournalNodeSyncer authentication is failing in secure cluster

Resolved

HDFS-12376 Enable JournalNode Sync by default

Resolved

(4 relates to)

Activity

People

Assignee:: Hanisha Koneru

Reporter:: Todd Lipcon

Votes:: 0 Vote for this issue

Watchers:: 21 Start watching this issue

Dates

Created:: 10/Oct/12 17:48

Updated:: 30/Oct/19 07:33

Resolved:: 23/Feb/17 00:37