Uploaded image for project: 'Hadoop YARN'
  1. Hadoop YARN
  2. YARN-4217

Failed AM attempt retries on same failed host

    XMLWordPrintableJSON

    Details

    • Type: Improvement
    • Status: Resolved
    • Priority: Major
    • Resolution: Duplicate
    • Affects Version/s: 2.7.1
    • Fix Version/s: None
    • Component/s: applications
    • Labels:
      None

      Description

      This happens when the cluster is maxed out. One node is going bad, so everything that happens on it fails, so the bad node is never busy. Since the cluster is maxed out, when the RM looks for a node with available resources, it will always find the almost bad one because nothing can run on it so it has available resources.

        Attachments

          Issue Links

            Activity

              People

              • Assignee:
                Unassigned
                Reporter:
                epayne Eric Payne
              • Votes:
                0 Vote for this issue
                Watchers:
                3 Start watching this issue

                Dates

                • Created:
                  Updated:
                  Resolved: