[HELIX-683] Clean monitoring cache upon helix controller enable monitoring - ASF JIRA

XML

Word

Printable

JSON

Details

Type: Bug
Status: Open
Priority: Major
Resolution: Unresolved
Affects Version/s: None
Fix Version/s: None
Component/s: None
Labels:
None

Description

We found a bug in reporting cluster status, partition masterless duration.

The root cause is that the duration is calculated based on controller cache. And currently, this cache is not cleaned when leadership is changed. As a result, if controller A start a mastership handoff but was interrupted once, the start time will be kept in cache until next mastership handoff on the same partition happens. Then the later handoff duration will be calculated based on the stale start time. This could be super large.

To fix it, we might consider clean cache when leadership changed.

Attachments

Issue Links

links to

GitHub Pull Request #162

Activity

People

Assignee:: Unassigned

Reporter:: Harry Zhang

Votes:: 0 Vote for this issue

Watchers:: 3 Start watching this issue

Dates

Created:: 26/Mar/18 19:10

Updated:: 26/Mar/18 22:53