Details
-
Bug
-
Status: Resolved
-
Blocker
-
Resolution: Fixed
-
None
-
None
Description
We should start scrubbing pipelines after SCM is out of safe mode.
Reasons to do this:
- Right now, we do scrub pipeline as part of triggerPipelineCreation, now when we scrub pipelines in allocated state for more than "ozone.scm.pipeline.allocated.timeout", we might close some pipelines and with this, we might not be able to come out of safeMode. As in SafeModeRules, we get pipeline count from pipelineDB during initialization.
Example scenario:
- Stop 3 Datanodes.
- Restart SCM.
- Start Datanode after 6 mts. We shall never come out of safe mode, as pipeline in allocated state will meet scrubber time out condition.
To not to be in these kinds of scenarios, better thing to be done here is scrub pipelines after SCM out of the safe mode
Attachments
Issue Links
- links to