Uploaded image for project: 'Slider'
  1. Slider
  2. SLIDER-1233

Lost nodes should not contribute to container failures

    XMLWordPrintableJSON

    Details

    • Type: Bug
    • Status: Resolved
    • Priority: Major
    • Resolution: Fixed
    • Affects Version/s: None
    • Fix Version/s: Slider 1.0.0
    • Component/s: core
    • Labels:
      None

      Description

      If a container completes due to an NM being lost, we should not count this towards container failures that may eventually cause the AM to fail the application. We are already using a ContainerOutcome of Completed (rather than Failed) for this type of container exit, so we just need to change the failure counting in that case. Other failure types associated with Completed are killed by the AM, killed by the RM, and killed after app completion, none of which need to contribute to container failures.

        Attachments

        1. SLIDER-1233.001.patch
          3 kB
          Billie Rinaldi
        2. SLIDER-1233.002.patch
          3 kB
          Billie Rinaldi

          Issue Links

            Activity

              People

              • Assignee:
                billie Billie Rinaldi
                Reporter:
                billie Billie Rinaldi
              • Votes:
                0 Vote for this issue
                Watchers:
                3 Start watching this issue

                Dates

                • Created:
                  Updated:
                  Resolved: