Details

    • Type: Bug Bug
    • Status: Closed
    • Priority: Critical Critical
    • Resolution: Fixed
    • Affects Version/s: 0.23.3, 2.0.0-alpha, 3.0.0
    • Fix Version/s: 0.23.3, 2.0.2-alpha
    • Component/s: None
    • Labels:
      None

      Description

      The MR AM always thinks that it is being killed by the RM when it gets a kill signal and it has not finished processing yet. In reality the RM kill signal is only sent when the client cannot communicate directly with the AM, which probably means that the AM is in a bad state already. The much more common case is that the node is marked as unhealthy or decomissioned.

      I propose that in the short term the AM will only clean up if

      1. The process has been asked by the client to exit (kill)
      2. The process job has finished cleanly and is exiting already
      3. This is that last retry of the AM retries.

      The downside here is that the .staging directory will be leaked and the job will not show up in the history server on an kill from the RM in some cases.

      At least until the full set of AM cleanup issues can be addressed, probably as part of MAPREDUCE-4428

      1. MR-4611.txt
        14 kB
        Robert Joseph Evans

        Activity

        Robert Joseph Evans created issue -
        Robert Joseph Evans made changes -
        Field Original Value New Value
        Attachment MR-4611.txt [ 12543142 ]
        Robert Joseph Evans made changes -
        Status Open [ 1 ] Patch Available [ 10002 ]
        Robert Joseph Evans made changes -
        Priority Major [ 3 ] Critical [ 2 ]
        Thomas Graves made changes -
        Status Patch Available [ 10002 ] Resolved [ 5 ]
        Fix Version/s 0.23.3 [ 12320060 ]
        Fix Version/s 3.0.0 [ 12320355 ]
        Fix Version/s 2.2.0-alpha [ 12322471 ]
        Resolution Fixed [ 1 ]
        Arun C Murthy made changes -
        Fix Version/s 3.0.0 [ 12320355 ]
        Arun C Murthy made changes -
        Status Resolved [ 5 ] Closed [ 6 ]

          People

          • Assignee:
            Robert Joseph Evans
            Reporter:
            Robert Joseph Evans
          • Votes:
            0 Vote for this issue
            Watchers:
            6 Start watching this issue

            Dates

            • Created:
              Updated:
              Resolved:

              Development