Uploaded image for project: 'Hadoop YARN'
  1. Hadoop YARN
  2. YARN-10720

YARN WebAppProxyServlet should support connection timeout to prevent proxy server from hanging

    XMLWordPrintableJSON

Details

    • Reviewed

    Description

      Following is proxy server show, too many connections from one client, this caused the proxy server hang, and the yarn web can't jump to web proxy.

      Following is the AM which is abnormal, but proxy server don't know it is abnormal already, so the connections can't be closed, we should add time out support in proxy server to prevent this. And one abnormal AM may cause hundreds even thousands of connections, it is very heavy.

       

      After i kill the abnormal AM, the proxy server become healthy. This case happened many times in our production clusters, our clusters are huge, and the abnormal AM will be existed in a regular case.

       

      I will add timeout supported in web proxy server in this jira.

       

      cc  pbacsko ebadger Jim_Brennan  ztang  epayne gandras  bteke

       

      Attachments

        1. image-2021-03-29-14-04-33-776.png
          95 kB
          Qi Zhu
        2. image-2021-03-29-14-05-32-708.png
          182 kB
          Qi Zhu
        3. YARN-10720.001.patch
          11 kB
          Qi Zhu
        4. YARN-10720.002.patch
          12 kB
          Qi Zhu
        5. YARN-10720.003.patch
          12 kB
          Qi Zhu
        6. YARN-10720.004.patch
          11 kB
          Qi Zhu
        7. YARN-10720.005.patch
          11 kB
          Qi Zhu
        8. YARN-10720.006.patch
          11 kB
          Qi Zhu

        Issue Links

          Activity

            People

              zhuqi Qi Zhu
              zhuqi Qi Zhu
              Votes:
              0 Vote for this issue
              Watchers:
              5 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved:

                Time Tracking

                  Estimated:
                  Original Estimate - Not Specified
                  Not Specified
                  Remaining:
                  Remaining Estimate - 0h
                  0h
                  Logged:
                  Time Spent - 1h
                  1h