Uploaded image for project: 'Hadoop YARN'
  1. Hadoop YARN
  2. YARN-5078

[Umbrella] NodeManager health checker improvements

Add voteVotersWatch issueWatchersCreate sub-taskLinkCloneUpdate Comment AuthorReplace String in CommentUpdate Comment VisibilityDelete Comments
    XMLWordPrintableJSON

Details

    • Bug
    • Status: Open
    • Major
    • Resolution: Unresolved
    • None
    • None
    • nodemanager

    Description

      There have been a bunch of NodeManager health checker improvement requests in the past.

      Right now, I expect that initially there just need to be a bunch of base functionality added. The most obvious parts are:

      • Finding appropriate measurements of health
      • Storing measurements as metrics. This should allow easy comparison of good nodes and bad nodes. This should eventually lead to threshold blacklisting/whitelisting.
      • Adding metrics to the NodeManager UI

      After this basic functionality is added, we can start consider some enhanced form of NodeManager health status conditions.

      Attachments

        Activity

          This comment will be Viewable by All Users Viewable by All Users
          Cancel

          People

            rchiang Ray Chiang
            rchiang Ray Chiang

            Dates

              Created:
              Updated:

              Slack

                Issue deployment