Uploaded image for project: 'Hadoop YARN'
  1. Hadoop YARN
  2. YARN-4697

NM aggregation thread pool is not bound by limits

    Details

    • Type: Improvement
    • Status: Resolved
    • Priority: Critical
    • Resolution: Fixed
    • Affects Version/s: None
    • Fix Version/s: 2.9.0, 3.0.0-alpha1
    • Component/s: nodemanager
    • Labels:
      None

      Description

      In the LogAggregationService.java we create a threadpool to upload logs from the nodemanager to HDFS if log aggregation is turned on. This is a cached threadpool which based on the javadoc is an ulimited pool of threads.
      In the case that we have had a problem with log aggregation this could cause a problem on restart. The number of threads created at that point could be huge and will put a large load on the NameNode and in worse case could even bring it down due to file descriptor issues.

        Attachments

        1. yarn4697.001.patch
          7 kB
          Haibo Chen
        2. yarn4697.002.patch
          8 kB
          Haibo Chen
        3. yarn4697.003.patch
          9 kB
          Haibo Chen
        4. yarn4697.004.patch
          12 kB
          Haibo Chen

          Issue Links

            Activity

              People

              • Assignee:
                haibochen Haibo Chen
                Reporter:
                haibochen Haibo Chen
              • Votes:
                0 Vote for this issue
                Watchers:
                13 Start watching this issue

                Dates

                • Created:
                  Updated:
                  Resolved: