Uploaded image for project: 'Hadoop YARN'
  1. Hadoop YARN
  2. YARN-9370

Better logging in recoverAssignedGpus in class GpuResourceAllocator

    XMLWordPrintableJSON

Details

    • Improvement
    • Status: Resolved
    • Trivial
    • Resolution: Fixed
    • None
    • 3.3.0
    • None

    Description

      The last line of org.apache.hadoop.yarn.server.nodemanager.containermanager.linux.resources.gpu.GpuResourceAllocator#recoverAssignedGpus is this:

      usedDevices.put(gpuDevice, containerId);
      

      We should have an info (or if not info, at least a debug) level log to indicate that a container is allocated to a GPU device during recovery.
      Please also check recovery related code, maybe there can be some room for improvement around logging.

      Attachments

        1. YARN-9370.006.patch
          2 kB
          Yesha Vora
        2. YARN-9370.005.patch
          2 kB
          Yesha Vora
        3. YARN-9370.004.patch
          2 kB
          Yesha Vora
        4. YARN-9370.003.patch
          2 kB
          Yesha Vora
        5. YARN-9370.002.patch
          2 kB
          Yesha Vora
        6. YARN-9370.001.patch
          2 kB
          Yesha Vora

        Activity

          People

            yeshavora Yesha Vora
            snemeth Szilard Nemeth
            Votes:
            0 Vote for this issue
            Watchers:
            4 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved: