[HBASE-22784] OldWALs not cleared in a replication slave cluster (cyclic replication bw 2 clusters) - ASF JIRA

XML

Word

Printable

JSON

Details

Type: Bug
Status: Resolved
Priority: Blocker
Resolution: Fixed
Affects Version/s: 1.4.9, 1.4.10
Fix Version/s: 1.5.0, 1.4.11
Component/s: regionserver, Replication
Labels:
None

Hadoop Flags:

Reviewed

Description

When a cluster is passive (receiving edits only via replication) in a cyclic replication setup of 2 clusters, OldWALs size keeps on growing. On analysing, we observed the following behaviour.

New entry is added to WAL (Edit replicated from other cluster).
ReplicationSourceWALReaderThread(RSWALRT) reads and applies the configured filters (due to cyclic replication setup, ClusterMarkingEntryFilter discards new entry from other cluster).
Entry is null, RSWALRT neither updates the batch stats (WALEntryBatch.lastWalPosition) nor puts it in the entryBatchQueue.
ReplicationSource thread is blocked in entryBachQueue.take().
So ReplicationSource#updateLogPosition has never invoked and WAL file is never cleared from ReplicationQueue.
Hence LogCleaner on the master, doesn't deletes the oldWAL files from hadoop.

NOTE: When a new edit is added via hbase-client, ReplicationSource thread process and clears the oldWAL files from replication queues and hence master cleans up the WALs

Please provide us a solution

Attachments

- Sort By Name
- Sort By Date
- Ascending
- Descending

HBASE-22784.branch-1.004.patch
09/Aug/19 10:02
10 kB
Wellington Chevreuil
HBASE-22784.branch-1.003.patch
08/Aug/19 23:40
10 kB
Wellington Chevreuil
HBASE-22784.branch-1.002.patch
08/Aug/19 14:52
7 kB
Wellington Chevreuil
HBASE-22784.branch-1.001.patch
08/Aug/19 11:20
5 kB
Wellington Chevreuil

Issue Links

is related to

HBASE-23169 Random region server aborts while clearing Old Wals

Open

relates to

HBASE-23205 Correctly update the position of WALs currently being replicated.

Resolved

Activity

People

Assignee:: Wellington Chevreuil

Reporter:: Solvannan R M

Votes:: 0 Vote for this issue

Watchers:: 9 Start watching this issue

Dates

Created:: 02/Aug/19 16:15

Updated:: 23/Oct/19 12:03

Resolved:: 14/Oct/19 03:38