Uploaded image for project: 'Solr'
  1. Solr
  2. SOLR-3280

to many / sometimes stale CLOSE_WAIT connections from SnapPuller during / after replication

    XMLWordPrintableJSON

Details

    • Bug
    • Status: Closed
    • Minor
    • Resolution: Incomplete
    • 3.5, 3.6, 4.0-ALPHA
    • None
    • None
    • None

    Description

      There are sometimes to many and also stale CLOSE_WAIT connections during/after replication left over on SLAVE server.
      Normally GC should clean up this but this is not always the case.
      Also if a CLOSE_WAIT is hanging then the new replication won't load.

      Dirty work around so far is to fake a TCP connection as root to that connection and close it.
      After that the new replication will load, the old index and searcher released and the system will
      return to normal operation.

      Background:
      The SnapPuller is using Apache httpclient 3.x and uses the MultiThreadedHttpConnectionManager.
      The manager holds a connection in CLOSE_WAIT after its use for further requests.
      This is done by calling releaseConnection. But if a connection is stuck it is not available any more and a new
      connection from the pool is used.

      Solution:
      After calling releaseConnection clean up with closeIdleConnections(0).

      Attachments

        1. SOLR-3280.patch
          0.8 kB
          Bernd Fehling

        Activity

          People

            rcmuir Robert Muir
            befehl Bernd Fehling
            Votes:
            0 Vote for this issue
            Watchers:
            4 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved: