Uploaded image for project: 'Hadoop YARN'
  1. Hadoop YARN
  2. YARN-543 [Umbrella] NodeManager localization related issues
  3. YARN-467

Jobs fail during resource localization when public distributed-cache hits unix directory limits

VotersWatch issueWatchersLinkCloneUpdate Comment AuthorReplace String in CommentUpdate Comment VisibilityDelete Comments
    XMLWordPrintableJSON

Details

    • Sub-task
    • Status: Closed
    • Major
    • Resolution: Fixed
    • 2.0.0-alpha, 3.0.0-alpha1
    • 2.1.0-beta
    • nodemanager
    • None
    • Reviewed

    Description

      If we have multiple jobs which uses distributed cache with small size of files, the directory limit reaches before reaching the cache size and fails to create any directories in file cache (PUBLIC). The jobs start failing with the below exception.

      java.io.IOException: mkdir of /tmp/nm-local-dir/filecache/3901886847734194975 failed
      at org.apache.hadoop.fs.FileSystem.primitiveMkdir(FileSystem.java:909)
      at org.apache.hadoop.fs.DelegateToFileSystem.mkdir(DelegateToFileSystem.java:143)
      at org.apache.hadoop.fs.FilterFs.mkdir(FilterFs.java:189)
      at org.apache.hadoop.fs.FileContext$4.next(FileContext.java:706)
      at org.apache.hadoop.fs.FileContext$4.next(FileContext.java:703)
      at org.apache.hadoop.fs.FileContext$FSLinkResolver.resolve(FileContext.java:2325)
      at org.apache.hadoop.fs.FileContext.mkdir(FileContext.java:703)
      at org.apache.hadoop.yarn.util.FSDownload.call(FSDownload.java:147)
      at org.apache.hadoop.yarn.util.FSDownload.call(FSDownload.java:49)
      at java.util.concurrent.FutureTask$Sync.innerRun(FutureTask.java:303)
      at java.util.concurrent.FutureTask.run(FutureTask.java:138)
      at java.util.concurrent.Executors$RunnableAdapter.call(Executors.java:441)
      at java.util.concurrent.FutureTask$Sync.innerRun(FutureTask.java:303)
      at java.util.concurrent.FutureTask.run(FutureTask.java:138)
      at java.util.concurrent.ThreadPoolExecutor$Worker.runTask(ThreadPoolExecutor.java:886)
      at java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:908)
      at java.lang.Thread.run(Thread.java:662)

      we need to have a mechanism where in we can create directory hierarchy and limit number of files per directory.

      Attachments

        1. yarn-467-testCode.tar
          17 kB
          Omkar Vinit Joshi
        2. yarn-467-20130402.2.patch
          41 kB
          Omkar Vinit Joshi
        3. yarn-467-20130402.1.patch
          35 kB
          Omkar Vinit Joshi
        4. yarn-467-20130402.patch
          36 kB
          Omkar Vinit Joshi
        5. yarn-467-20130401.patch
          34 kB
          Omkar Vinit Joshi
        6. yarn-467-20130328.patch
          31 kB
          Omkar Vinit Joshi
        7. yarn-467-20130325.1.patch
          31 kB
          Omkar Vinit Joshi
        8. yarn-467-20130325.path
          32 kB
          Omkar Vinit Joshi
        9. yarn-467-20130322.3.patch
          30 kB
          Omkar Vinit Joshi
        10. yarn-467-20130322.2.patch
          30 kB
          Omkar Vinit Joshi
        11. yarn-467-20130322.1.patch
          30 kB
          Omkar Vinit Joshi
        12. yarn-467-20130322.patch
          30 kB
          Omkar Vinit Joshi

        Issue Links

        Activity

          This comment will be Viewable by All Users Viewable by All Users
          Cancel

          People

            ojoshi Omkar Vinit Joshi
            ojoshi Omkar Vinit Joshi
            Votes:
            0 Vote for this issue
            Watchers:
            6 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved:

              Slack

                Issue deployment