Uploaded image for project: 'Hadoop Common'
  1. Hadoop Common
  2. HADOOP-9898

Set SO_KEEPALIVE on all our sockets

    XMLWordPrintableJSON

    Details

    • Type: Bug
    • Status: Closed
    • Priority: Minor
    • Resolution: Fixed
    • Affects Version/s: 3.0.0-alpha1
    • Fix Version/s: 2.3.0
    • Component/s: ipc, net
    • Labels:
      None

      Description

      We recently saw an issue where network issues between slaves and the NN caused ESTABLISHED TCP connections to pile up and leak on the NN side. It looks like the RST packets were getting dropped, which meant that the client thought the connections were closed, while they hung open forever on the server.

      Setting the SO_KEEPALIVE option on our sockets would prevent this kind of leak from going unchecked.

        Attachments

        1. hadoop-9898.txt
          2 kB
          Todd Lipcon

          Issue Links

            Activity

              People

              • Assignee:
                tlipcon Todd Lipcon
                Reporter:
                tlipcon Todd Lipcon
              • Votes:
                0 Vote for this issue
                Watchers:
                11 Start watching this issue

                Dates

                • Created:
                  Updated:
                  Resolved: