Uploaded image for project: 'Hadoop YARN'
  1. Hadoop YARN
  2. YARN-4697

NM aggregation thread pool is not bound by limits

VotersWatch issueWatchersCreate sub-taskLinkCloneUpdate Comment AuthorReplace String in CommentUpdate Comment VisibilityDelete Comments
    XMLWordPrintableJSON

    Details

    • Type: Improvement
    • Status: Resolved
    • Priority: Critical
    • Resolution: Fixed
    • Affects Version/s: None
    • Fix Version/s: 2.9.0, 3.0.0-alpha1
    • Component/s: nodemanager
    • Labels:
      None

      Description

      In the LogAggregationService.java we create a threadpool to upload logs from the nodemanager to HDFS if log aggregation is turned on. This is a cached threadpool which based on the javadoc is an ulimited pool of threads.
      In the case that we have had a problem with log aggregation this could cause a problem on restart. The number of threads created at that point could be huge and will put a large load on the NameNode and in worse case could even bring it down due to file descriptor issues.

        Attachments

        1. yarn4697.001.patch
          7 kB
          Haibo Chen
        2. yarn4697.002.patch
          8 kB
          Haibo Chen
        3. yarn4697.003.patch
          9 kB
          Haibo Chen
        4. yarn4697.004.patch
          12 kB
          Haibo Chen

        Issue Links

          Activity

            People

            • Assignee:
              haibochen Haibo Chen
              Reporter:
              haibochen Haibo Chen

              Dates

              • Created:
                Updated:
                Resolved:

                Issue deployment