Details
-
Bug
-
Status: Resolved
-
Major
-
Resolution: Done
-
None
Description
Currently redundant task manager check in DeclarativeSlotManager only compare free slots with required redundant slots.
when there are no enough resources in YARN/Kubernetes, this mechanism will always try to request new task manager.
there are two way to address this.
1. maintain the state of redundant workers to avoid request twice
2. only try to request redundant workers when there is no pending worker
The first way will make the logic of redundant worker too complicated, I would like to choose the second way
Looking forward to any suggestion.
Attachments
Attachments
Issue Links
- links to