Uploaded image for project: 'Hadoop YARN'
  1. Hadoop YARN
  2. YARN-8667

Cleanup symlinks when container restarted by NM to solve issue "find: File system loop detected;" for tar ball artifacts.

    Details

    • Type: Bug
    • Status: Resolved
    • Priority: Critical
    • Resolution: Fixed
    • Affects Version/s: None
    • Fix Version/s: 3.2.0, 3.1.2
    • Component/s: None
    • Labels:
      None

      Description

      Service is launched with TAR BALL artifacts. If a container is exited due to any reasons, container relaunch policy try to relaunch the container on same node with same container work space. As a result, container relaunch is keep on failing.

      If container relaunch max-retry policy is disabled, then container never launched in any other nodes also rather it keep on retrying on same node manager which never succeeds.

      Relaunching Container container_e05_1533635581781_0001_01_000002. Remaining retry attempts(after relaunch) : -4816.
      

      There are two issues

      1. Container relaunch is keep on failing
      2. Log message is misleading

        Attachments

        1. YARN-8667.001.patch
          10 kB
          Chandni Singh
        2. YARN-8667.002.patch
          8 kB
          Chandni Singh

          Activity

            People

            • Assignee:
              csingh Chandni Singh
              Reporter:
              rohithsharma Rohith Sharma K S
            • Votes:
              0 Vote for this issue
              Watchers:
              6 Start watching this issue

              Dates

              • Created:
                Updated:
                Resolved: