Uploaded image for project: 'Hadoop YARN'
  1. Hadoop YARN
  2. YARN-1686

NodeManager.resyncWithRM() does not handle exception which cause NodeManger to Hang.

    XMLWordPrintableJSON

Details

    • Bug
    • Status: Closed
    • Major
    • Resolution: Fixed
    • 2.3.0
    • 2.4.0
    • nodemanager
    • None
    • Reviewed

    Description

      During start of NodeManager,if registration with resourcemanager throw exception then nodemager shutdown happens.

      Consider case where NM-1 is registered with RM. RM issued Resync to NM. If any exception thrown in "resyncWithRM" (starts new thread which does not handle exception) during RESYNC evet, then this thread is lost. NodeManger enters hanged state.

      Attachments

        1. YARN-1686.1.patch
          5 kB
          Rohith Sharma K S
        2. YARN-1686.2.patch
          5 kB
          Rohith Sharma K S
        3. YARN-1686.3.patch
          5 kB
          Vinod Kumar Vavilapalli

        Activity

          People

            rohithsharma Rohith Sharma K S
            rohithsharma Rohith Sharma K S
            Votes:
            0 Vote for this issue
            Watchers:
            7 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved: