Uploaded image for project: 'Hadoop YARN'
  1. Hadoop YARN
  2. YARN-9235

If linux container executor is not set for a GPU cluster GpuResourceHandlerImpl is not initialized and NPE is thrown

VotersWatch issueWatchersCreate sub-taskLinkCloneUpdate Comment AuthorReplace String in CommentUpdate Comment VisibilityDelete Comments
    XMLWordPrintableJSON

    Details

    • Type: Bug
    • Status: Resolved
    • Priority: Major
    • Resolution: Fixed
    • Affects Version/s: 3.0.0, 3.1.0
    • Fix Version/s: 3.3.0, 3.2.1, 3.1.3
    • Component/s: yarn
    • Labels:
      None
    • Hadoop Flags:
      Reviewed

      Description

      If GPU plugin is enabled for the NodeManager, it is possible to run jobs with GPU.

      However, if LinuxContainerExecutor is not configured, an NPE is thrown when calling 

      GpuResourcePlugin.getNMResourceInfo

      Also, there are no warns in the log if GPU is misconfigured like this. 

        Attachments

        1. YARN-9235.001.patch
          3 kB
          Antal Bálint Steinbach
        2. YARN-9235.002.patch
          6 kB
          Antal Bálint Steinbach
        3. YARN-9235.003.patch
          6 kB
          Antal Bálint Steinbach
        4. YARN-9235.004.patch
          6 kB
          Szilard Nemeth
        5. YARN-9235.004.patch
          6 kB
          Antal Bálint Steinbach
        6. YARN-9235.branch-3.1.001.patch
          6 kB
          Szilard Nemeth
        7. YARN-9235.branch-3.2.001.patch
          6 kB
          Szilard Nemeth

        Issue Links

          Activity

            People

            • Assignee:
              adam.antal Adam Antal
              Reporter:
              bsteinbach Antal Bálint Steinbach

              Dates

              • Created:
                Updated:
                Resolved:

                Issue deployment