Uploaded image for project: 'Hadoop YARN'
  1. Hadoop YARN
  2. YARN-9802 YARN Timeline Service v2 (post GA features)
  3. YARN-10068

TimelineV2Client may leak file descriptors creating ClientResponse objects.

    XMLWordPrintableJSON

Details

    • Sub-task
    • Status: Resolved
    • Critical
    • Resolution: Fixed
    • 3.0.0
    • 3.3.0
    • ATSv2
    • None
    • HDP VERSION3.1.4

      AMBARI VERSION2.7.4.0

    Description

      Hi team,

      Code-walkthrough between v1 and v2 of TimelineClient API revealed that v2 API TimelineV2ClientImpl#putObjects doesn't close ClientResponse objects under success status returned from Timeline Server. ClientResponse is closed only under erroneous response from the server using ClientResponse#getEntity.

      We also noticed that TimelineClient (v1) closes the ClientResponse object in TimelineWriter#putEntities by calling ClientResponse#getEntity in both success and error conditions from the server thereby avoiding this file descriptor leak.

      Customer's original issue and the symptom was that the NodeManager went down because of 'too many files open' condition where there were lots of CLOSED_WAIT sockets observed between the timeline client (from NM) and the timeline server hosts. 

      Could you please help resolve this issue ? Thanks.

       

      Attachments

        1. image-2020-01-02-14-58-12-773.png
          4.40 MB
          Anand Srinivasan
        2. YARN-10068.001.patch
          3 kB
          Anand Srinivasan
        3. YARN-10068.002.patch
          3 kB
          Anand Srinivasan
        4. YARN-10068.003.patch
          3 kB
          Anand Srinivasan

        Activity

          People

            anand.srinivasan Anand Srinivasan
            anand.srinivasan Anand Srinivasan
            Votes:
            0 Vote for this issue
            Watchers:
            5 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved: