Uploaded image for project: 'Hadoop YARN'
  1. Hadoop YARN
  2. YARN-1686

NodeManager.resyncWithRM() does not handle exception which cause NodeManger to Hang.

    XMLWordPrintableJSON

    Details

    • Type: Bug
    • Status: Closed
    • Priority: Major
    • Resolution: Fixed
    • Affects Version/s: 2.3.0
    • Fix Version/s: 2.4.0
    • Component/s: nodemanager
    • Labels:
      None
    • Hadoop Flags:
      Reviewed

      Description

      During start of NodeManager,if registration with resourcemanager throw exception then nodemager shutdown happens.

      Consider case where NM-1 is registered with RM. RM issued Resync to NM. If any exception thrown in "resyncWithRM" (starts new thread which does not handle exception) during RESYNC evet, then this thread is lost. NodeManger enters hanged state.

        Attachments

        1. YARN-1686.3.patch
          5 kB
          Vinod Kumar Vavilapalli
        2. YARN-1686.2.patch
          5 kB
          Rohith Sharma K S
        3. YARN-1686.1.patch
          5 kB
          Rohith Sharma K S

          Activity

            People

            • Assignee:
              rohithsharma Rohith Sharma K S
              Reporter:
              rohithsharma Rohith Sharma K S
            • Votes:
              0 Vote for this issue
              Watchers:
              7 Start watching this issue

              Dates

              • Created:
                Updated:
                Resolved: