Uploaded image for project: 'Hadoop YARN'
  1. Hadoop YARN
  2. YARN-209

Capacity scheduler doesn't trigger app-activation after adding nodes

    XMLWordPrintableJSON

Details

    • Bug
    • Status: Closed
    • Major
    • Resolution: Fixed
    • None
    • 2.1.0-beta
    • capacityscheduler
    • None
    • Reviewed

    Description

      Say application A is submitted but at that time it does not meet the bar for activation because of resource limit settings for applications. After that if more hardware is added to the system and the application becomes valid it still remains in pending state, likely forever.
      This might be rare to hit in real life because enough NM's heartbeat to the RM before applications can get submitted. But a change in settings or heartbeat interval might make it easier to repro. In RM restart scenarios, this will likely hit more if its implemented by re-playing events and re-submitting applications to the scheduler before the RPC to NM's is activated.

      Attachments

        1. YARN-209-test.patch
          2 kB
          Bikas Saha
        2. YARN-209.4.patch
          7 kB
          Zhijie Shen
        3. YARN-209.3.patch
          8 kB
          Zhijie Shen
        4. YARN-209.2.patch
          3 kB
          Zhijie Shen
        5. YARN-209.1.patch
          1 kB
          Bikas Saha

        Issue Links

          Activity

            People

              zjshen Zhijie Shen
              bikassaha Bikas Saha
              Votes:
              0 Vote for this issue
              Watchers:
              11 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: