Details
-
Improvement
-
Status: Resolved
-
Major
-
Resolution: Fixed
-
None
Description
During test of ozone make one of DN down, its observed,
- HeartBeat timeout and retry every second
- Close ratis pipeline, it retry every 10 millisecond even follower node not present
On Failure path, there should not be aggressive retry,
Performance Impact on Leader DN where one of DN is down, this will overload the DN with very frequent retry and pilling of netty objects.
Attachments
Attachments
Issue Links
- causes
-
HDDS-8299 Disk full situation on a leader DN may result in followers getting stuck in a retry loop
- Resolved
- is duplicated by
-
HDDS-8297 Do not close open pipelines and open containers when a node becomes stale
- Resolved
- relates to
-
RATIS-1945 INCONSISTENCY replies should be counted as errors.
- Resolved
- links to