Uploaded image for project: 'Hadoop YARN'
  1. Hadoop YARN
  2. YARN-541

getAllocatedContainers() is not returning all the allocated containers

    XMLWordPrintableJSON

Details

    • Bug
    • Status: Closed
    • Blocker
    • Resolution: Fixed
    • 2.0.3-alpha
    • 2.1.0-beta
    • resourcemanager
    • None
    • Redhat Linux 64-bit

    • Reviewed

    Description

      I am running an application that was written and working well with the hadoop-2.0.0-alpha but when I am running the same against 2.0.3-alpha, the getAllocatedContainers() method called on AMResponse is not returning all the containers allocated sometimes. For example, I request for 10 containers and this method gives me only 9 containers sometimes, and when I looked at the log of Resource Manager, the 10th container is also allocated. It happens only sometimes randomly and works fine all other times. If I send one more request for the remaining container to RM after it failed to give them the first time(and before releasing already acquired ones), it could allocate that container. I am running only one application at a time, but 1000s of them one after another.

      My main worry is, even though the RM's log is saying that all 10 requested containers are allocated, the getAllocatedContainers() method is not returning me all of them, it returned only 9 surprisingly. I never saw this kind of issue in the previous version, i.e. hadoop-2.0.0-alpha.

      Thanks,
      Kishore

      Attachments

        1. YARN-541.1.patch
          1 kB
          Bikas Saha
        2. yarn-dsadm-resourcemanager-isredeng.out
          587 kB
          Krishna Kishore Bonagiri
        3. yarn-dsadm-nodemanager-isredeng.out
          1.11 MB
          Krishna Kishore Bonagiri
        4. AppMaster.stdout
          100 kB
          Krishna Kishore Bonagiri

        Activity

          People

            bikassaha Bikas Saha
            write2kishore Krishna Kishore Bonagiri
            Votes:
            0 Vote for this issue
            Watchers:
            7 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved: