Uploaded image for project: 'Hadoop YARN'
  1. Hadoop YARN
  2. YARN-11457

NodeManager Resource Leak when handling a container log with colon

    XMLWordPrintableJSON

Details

    • Bug
    • Status: Open
    • Major
    • Resolution: Unresolved
    • 3.3.3
    • None
    • nodemanager
    • None

    Description

      NodeManager Leaks the resources when handling a container log with colon. The Illegal file name is not handled and leads to resource leak at NodeManager side.

       

      2023-03-14 11:03:53,390 WARN org.apache.hadoop.util.concurrent.ExecutorHelper (ContainersLauncher #2683): Caught exception in thread ContainersLauncher #2683: 
      java.lang.IllegalArgumentException: java.net.URISyntaxException: Relative path in absolute URI: taskmanager.log.2023-03-14 09:44-1
      	at org.apache.hadoop.fs.Path.initialize(Path.java:263)
      	at org.apache.hadoop.fs.Path.<init>(Path.java:221)
      	at org.apache.hadoop.fs.Path.<init>(Path.java:129)
      	at org.apache.hadoop.fs.Globber.doGlob(Globber.java:270)
      	at org.apache.hadoop.fs.Globber.glob(Globber.java:149)
      	at org.apache.hadoop.fs.FileSystem.globStatus(FileSystem.java:2096)
      	at org.apache.hadoop.fs.FileSystem.globStatus(FileSystem.java:2078)
      	at org.apache.hadoop.yarn.server.nodemanager.containermanager.launcher.ContainerLaunch.handleContainerExitWithFailure(ContainerLaunch.java:653)
      	at org.apache.hadoop.yarn.server.nodemanager.containermanager.launcher.ContainerLaunch.handleContainerExitCode(ContainerLaunch.java:593)
      	at org.apache.hadoop.yarn.server.nodemanager.containermanager.launcher.ContainerLaunch.call(ContainerLaunch.java:337)
      	at org.apache.hadoop.yarn.server.nodemanager.containermanager.launcher.ContainerLaunch.call(ContainerLaunch.java:101)
      	at java.util.concurrent.FutureTask.run(FutureTask.java:266)
      	at java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1149)
      	at java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:624)
      	at java.lang.Thread.run(Thread.java:750)
      Caused by: java.net.URISyntaxException: Relative path in absolute URI: taskmanager.log.2023-03-14 09:44-1
      	at java.net.URI.checkPath(URI.java:1823)
      	at java.net.URI.<init>(URI.java:745)
      	at org.apache.hadoop.fs.Path.initialize(Path.java:260)
      	... 14 more 
      

      NodeManager status details shows Application stuck in FINISHING_CONTAINER_WAIT, Containers stuck in KILLING state.

      Attachments

        1. Screenshot 2023-03-16 at 1.02.45 PM.png
          130 kB
          Prabhu Joseph
        2. Screenshot 2023-03-16 at 1.02.57 PM.png
          114 kB
          Prabhu Joseph
        3. Screenshot 2023-03-16 at 1.02.22 PM.png
          356 kB
          Prabhu Joseph

        Activity

          People

            vineethNaroju Vineeth Naroju
            prabhujoseph Prabhu Joseph
            Votes:
            0 Vote for this issue
            Watchers:
            1 Start watching this issue

            Dates

              Created:
              Updated: