Hadoop YARN
  1. Hadoop YARN
  2. YARN-201

CapacityScheduler can take a very long time to schedule containers if requests are off cluster

    Details

    • Type: Bug Bug
    • Status: Closed
    • Priority: Critical Critical
    • Resolution: Fixed
    • Affects Version/s: 0.23.3, 2.0.1-alpha
    • Fix Version/s: 2.0.3-alpha, 0.23.5
    • Component/s: capacityscheduler
    • Labels:
      None

      Description

      When a user runs a job where one of the input files is a large file on another cluster, the job can create many splits on nodes which are unreachable for computation from the current cluster. The off-switch delay logic in LeafQueue can cause the ResourceManager to allocate containers for the job very slowly. In one case the job was only getting one container every 23 seconds, and the queue had plenty of spare capacity.

      1. YARN-201.patch
        4 kB
        Jason Lowe
      2. YARN-201.patch
        4 kB
        Jason Lowe

        Activity

        No work has yet been logged on this issue.

          People

          • Assignee:
            Jason Lowe
            Reporter:
            Jason Lowe
          • Votes:
            0 Vote for this issue
            Watchers:
            10 Start watching this issue

            Dates

            • Created:
              Updated:
              Resolved:

              Development