Details

    • Type: New Feature
    • Status: Patch Available
    • Priority: Major
    • Resolution: Unresolved
    • Affects Version/s: 2.7.1
    • Fix Version/s: None
    • Component/s: nodemanager
    • Labels:
      None
    • Target Version/s:

      Description

      An unmanaged container / leaked container is a container which is no longer managed by NM. Thus, it is cannot be managed / leaked by YARN, too.

      There are many cases a YARN managed container can become unmanaged, such as:

      • NM service is disabled or removed on the node.
      • NM is unable to start up again on the node, such as depended configuration, or resources cannot be ready.
      • NM local leveldb store is corrupted or lost, such as bad disk sectors.
      • NM has bugs, such as wrongly mark live container as complete.

      Note, they are caused or things become worse if work-preserving NM restart enabled, see YARN-1336

      Bad impacts of unmanaged container, such as:

      1. Resource cannot be managed for YARN on the node:
        • Cause YARN on the node resource leak
        • Cannot kill the container to release YARN resource on the node to free up resource for other urgent computations on the node.
      2. Container and App killing is not eventually consistent for App user:
        • App which has bugs can still produce bad impacts to outside even if the App is killed for a long time

        Attachments

          Issue Links

            Activity

              People

              • Assignee:
                yqwang Yuqi Wang
                Reporter:
                yqwang Yuqi Wang
              • Votes:
                0 Vote for this issue
                Watchers:
                13 Start watching this issue

                Dates

                • Created:
                  Updated: