Details
Description
Recently, we have observed serious event blocking in RM event dispatcher queue. After analysis of RM event monitoring data and RM event processing logic, we found that
1) environment: a cluster with thousands of nodes
2) RMNodeStatusEvent dominates 90% time consumption of RM event scheduler
3) Meanwhile, RM event processing is in a single-thread mode, and It results in the low headroom of RM event scheduler, thus performance of RM.
So we proposed a RM multi-thread event processing mechanism to improve RM performance.
Attachments
Attachments
Issue Links
- Blocked
-
YARN-10784 Optimize RMNodeStatusEvent process performance
- Resolved
- duplicates
-
YARN-9926 RM multi-thread event processing mechanism
- Resolved