Uploaded image for project: 'Hadoop YARN'
  1. Hadoop YARN
  2. YARN-1274

LCE fails to run containers that don't have resources to localize

    XMLWordPrintableJSON

Details

    • Bug
    • Status: Closed
    • Blocker
    • Resolution: Fixed
    • 2.1.1-beta
    • 2.2.0
    • nodemanager
    • None
    • Reviewed

    Description

      LCE container launch assumes the usercache/USER directory exists and it is owned by the user running the container process.

      But the directory is created only if there are resources to localize by the LCE localization command, if there are not resourcdes to localize, LCE localization never executes and launching fails reporting 255 exit code and the NM logs have something like:

      2013-10-04 14:07:56,425 INFO org.apache.hadoop.yarn.server.nodemanager.ContainerExecutor: main : command provided 1
      2013-10-04 14:07:56,425 INFO org.apache.hadoop.yarn.server.nodemanager.ContainerExecutor: main : user is llama
      2013-10-04 14:07:56,425 INFO org.apache.hadoop.yarn.server.nodemanager.ContainerExecutor: Can't create directory llama in /yarn/nm/usercache/llama/appcache/application_1380853306301_0004/container_1380853306301_0004_01_000004 - Permission denied
      

      Attachments

        1. YARN-1274.1.txt
          4 kB
          Siddharth Seth
        2. YARN-1274.trunk.1.txt
          4 kB
          Siddharth Seth
        3. YARN-1274.trunk.2.txt
          4 kB
          Vinod Kumar Vavilapalli

        Issue Links

          Activity

            People

              sseth Siddharth Seth
              tucu00 Alejandro Abdelnur
              Votes:
              0 Vote for this issue
              Watchers:
              5 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: