Uploaded image for project: 'Hadoop YARN'
  1. Hadoop YARN
  2. YARN-9370

Better logging in recoverAssignedGpus in class GpuResourceAllocator

    XMLWordPrintableJSON

    Details

    • Type: Improvement
    • Status: Resolved
    • Priority: Trivial
    • Resolution: Fixed
    • Affects Version/s: None
    • Fix Version/s: 3.3.0
    • Component/s: None
    • Labels:

      Description

      The last line of org.apache.hadoop.yarn.server.nodemanager.containermanager.linux.resources.gpu.GpuResourceAllocator#recoverAssignedGpus is this:

      usedDevices.put(gpuDevice, containerId);
      

      We should have an info (or if not info, at least a debug) level log to indicate that a container is allocated to a GPU device during recovery.
      Please also check recovery related code, maybe there can be some room for improvement around logging.

        Attachments

        1. YARN-9370.001.patch
          2 kB
          Yesha Vora
        2. YARN-9370.002.patch
          2 kB
          Yesha Vora
        3. YARN-9370.003.patch
          2 kB
          Yesha Vora
        4. YARN-9370.004.patch
          2 kB
          Yesha Vora
        5. YARN-9370.005.patch
          2 kB
          Yesha Vora
        6. YARN-9370.006.patch
          2 kB
          Yesha Vora

          Activity

            People

            • Assignee:
              yeshavora Yesha Vora
              Reporter:
              snemeth Szilard Nemeth
            • Votes:
              0 Vote for this issue
              Watchers:
              4 Start watching this issue

              Dates

              • Created:
                Updated:
                Resolved: