Uploaded image for project: 'Hadoop Common'
  1. Hadoop Common
  2. HADOOP-9898

Set SO_KEEPALIVE on all our sockets

VotersWatch issueWatchersCreate sub-taskLinkCloneUpdate Comment AuthorReplace String in CommentUpdate Comment VisibilityDelete Comments
    XMLWordPrintableJSON

    Details

    • Type: Bug
    • Status: Closed
    • Priority: Minor
    • Resolution: Fixed
    • Affects Version/s: 3.0.0-alpha1
    • Fix Version/s: 2.3.0
    • Component/s: ipc, net
    • Labels:
      None

      Description

      We recently saw an issue where network issues between slaves and the NN caused ESTABLISHED TCP connections to pile up and leak on the NN side. It looks like the RST packets were getting dropped, which meant that the client thought the connections were closed, while they hung open forever on the server.

      Setting the SO_KEEPALIVE option on our sockets would prevent this kind of leak from going unchecked.

        Attachments

        1. hadoop-9898.txt
          2 kB
          Todd Lipcon

        Issue Links

          Activity

            People

            • Assignee:
              tlipcon Todd Lipcon
              Reporter:
              tlipcon Todd Lipcon

              Dates

              • Created:
                Updated:
                Resolved:

                Issue deployment