Uploaded image for project: 'HBase'
  1. HBase
  2. HBASE-11142

Taking snapshots can leave sockets on the master stuck in CLOSE_WAIT state

    XMLWordPrintableJSON

Details

    • Bug
    • Status: Closed
    • Major
    • Resolution: Cannot Reproduce
    • 0.94.2, 0.99.0, 0.96.1.1, 0.98.2
    • None
    • None
    • None

    Description

      As reported by Hansi Klose on user@.

      we use a script to take on a regular basis snapshot's and delete old one's.
      We recognizes that the web interface of the hbase master was not working any more because of too many open files.
      The master reaches his number of open file limit of 32768
      When I run lsof I saw that there where a lot of TCP CLOSE_WAIT handles open with the regionserver as target.
      On the regionserver there is just one connection to the hbase master.
      I can see that the count of the CLOSE_WAIT handles grow each time
      i take a snapshot. When i delete on nothing changes.
      Each time i take a snapshot there are 20 - 30 new CLOSE_WAIT handles.

      Attachments

        Activity

          People

            Unassigned Unassigned
            apurtell Andrew Kyle Purtell
            Votes:
            0 Vote for this issue
            Watchers:
            4 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved: