Details
-
New Feature
-
Status: Resolved
-
Major
-
Resolution: Fixed
-
2.8.0
-
None
-
None
Description
The new JVM pause monitor has been written with its own start/stop lifecycle which has already proven brittle to both ordering of operations and, even after HADOOP-12313, is not thread safe (both start and stop are potentially re-entrant).
It also requires every class which supports the monitor to add another field and perform the lifecycle operations in its own lifecycle, which, for all Yarn services, is the YARN app lifecycle (as implemented in Hadoop common)
Making the monitor a subclass of AbstractService and moving the init/start & stop operations in serviceInit(), serviceStart() & serviceStop() methods will fix the concurrency and state model issues, and make it trivial to add as a child to any YARN service which subclasses CompositeService (most the NM and RM apps) will be able to hook up the monitor simply by creating one in the ctor and adding it as a child.
Attachments
Attachments
Issue Links
- contains
-
HDFS-8947 NameNode, DataNode and NFS gateway to support JvmPauseMonitor as a service
- Resolved
-
MAPREDUCE-6462 JobHistoryServer to support JvmPauseMonitor as a service
- Resolved
- is depended upon by
-
HDFS-8947 NameNode, DataNode and NFS gateway to support JvmPauseMonitor as a service
- Resolved
-
MAPREDUCE-6462 JobHistoryServer to support JvmPauseMonitor as a service
- Resolved
-
YARN-4072 ApplicationHistoryServer, WebAppProxyServer, NodeManager and ResourceManager to support JvmPauseMonitor as a service
- Resolved
- is related to
-
HADOOP-12313 NPE in JvmPauseMonitor when calling stop() before start()
- Resolved
-
HIVE-14150 Hive does not compile against Hadoop-2.9.0-SNAPSHOT
- Resolved
- links to