[HDFS-4238] [HA] Standby namenode should not do purging of shared storage edits. - ASF JIRA

XML

Word

Printable

JSON

Details

Type: Bug
Status: Closed
Priority: Major
Resolution: Fixed
Affects Version/s: 2.0.2-alpha, 3.0.0-alpha1
Fix Version/s: 2.0.3-alpha
Component/s: ha
Labels:
None

Hadoop Flags:

Reviewed

Description

This happened in our cluster,

>> Standby NN was keep doing checkpoint every one hour and uploading to Active NN was continuously failing due to some kerberos issue and nobody noticed this, since Active was servicing properly.

>> Active NN was up for long time with fsimage having very least transaction.

>> Standby NN has saved the checkpoint in its name dir and purged the txns > 1000000 from shared storage ( includes edits which are not present in Active NN's fsimage)

>> After some time Active NN is restarted and StandBy NN switched to Active.

Now current Standby not able to load any edits from shared storage, as expected edits are not present in shared storage. Its keep running idle.

So editLog.purgeLogsOlderThan(purgeLogsFrom); always should be called from Active NameNode.

Attachments

- Sort By Name
- Sort By Date
- Ascending
- Descending

hdfs-4238.txt
04/Dec/12 00:23
7 kB
Todd Lipcon
hdfs-4238.txt
04/Dec/12 01:19
6 kB
Todd Lipcon

Activity

People

Assignee:: Todd Lipcon

Reporter:: Vinayakumar B

Votes:: 0 Vote for this issue

Watchers:: 9 Start watching this issue

Dates

Created:: 29/Nov/12 10:52

Updated:: 12/May/16 18:12

Resolved:: 05/Dec/12 21:19