Uploaded image for project: 'Hadoop YARN'
  1. Hadoop YARN
  2. YARN-8667

Cleanup symlinks when container restarted by NM to solve issue "find: File system loop detected;" for tar ball artifacts.

    XMLWordPrintableJSON

Details

    • Bug
    • Status: Resolved
    • Critical
    • Resolution: Fixed
    • None
    • 3.2.0, 3.1.2
    • None
    • None

    Description

      Service is launched with TAR BALL artifacts. If a container is exited due to any reasons, container relaunch policy try to relaunch the container on same node with same container work space. As a result, container relaunch is keep on failing.

      If container relaunch max-retry policy is disabled, then container never launched in any other nodes also rather it keep on retrying on same node manager which never succeeds.

      Relaunching Container container_e05_1533635581781_0001_01_000002. Remaining retry attempts(after relaunch) : -4816.
      

      There are two issues

      1. Container relaunch is keep on failing
      2. Log message is misleading

      Attachments

        1. YARN-8667.002.patch
          8 kB
          Chandni Singh
        2. YARN-8667.001.patch
          10 kB
          Chandni Singh

        Activity

          People

            csingh Chandni Singh
            rohithsharma Rohith Sharma K S
            Votes:
            0 Vote for this issue
            Watchers:
            7 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved: