Uploaded image for project: 'Hadoop YARN'
  1. Hadoop YARN
  2. YARN-4697

NM aggregation thread pool is not bound by limits

    XMLWordPrintableJSON

Details

    • Improvement
    • Status: Resolved
    • Critical
    • Resolution: Fixed
    • None
    • 2.9.0, 3.0.0-alpha1
    • nodemanager
    • None

    Description

      In the LogAggregationService.java we create a threadpool to upload logs from the nodemanager to HDFS if log aggregation is turned on. This is a cached threadpool which based on the javadoc is an ulimited pool of threads.
      In the case that we have had a problem with log aggregation this could cause a problem on restart. The number of threads created at that point could be huge and will put a large load on the NameNode and in worse case could even bring it down due to file descriptor issues.

      Attachments

        1. yarn4697.001.patch
          7 kB
          Haibo Chen
        2. yarn4697.002.patch
          8 kB
          Haibo Chen
        3. yarn4697.003.patch
          9 kB
          Haibo Chen
        4. yarn4697.004.patch
          12 kB
          Haibo Chen

        Issue Links

          Activity

            People

              haibochen Haibo Chen
              haibochen Haibo Chen
              Votes:
              0 Vote for this issue
              Watchers:
              12 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: