Details
-
Sub-task
-
Status: Resolved
-
Major
-
Resolution: Fixed
-
2.8.0
-
None
-
Reviewed
Description
EntityGroupFSTimelineStore now depends on an RM being up and running; the configuration pointing to it. This is a new change, and impacts testing where you have historically been able to test without an RM running.
The sole purpose of the probe is to automatically determine if an app is running; it falls back to "unknown" if not. If the RM connection was optional, the "unknown" codepath could be called directly, relying on age of file as a metric of completion
Options
- add a flag to disable RM connect
- skip automatically if RM not defined/set to 0.0.0.0
- disable retries on yarn client IPC; if it fails, tag app as unknown.
Attachments
Attachments
Issue Links
- contains
-
YARN-4772 Overloaded leveljb can crash the ATS "pthread lock: Invalid argument"
- Open
- incorporates
-
YARN-4695 EntityGroupFSTimelineStore to not log errors during shutdown
- Resolved
-
YARN-4716 TimelineClient to implement Flushable; propagate to writer
- Resolved
- is related to
-
YARN-4705 ATS 1.5 parse pipeline to consider handling open() events recoverably
- Open