Uploaded image for project: 'HBase'
  1. HBase
  2. HBASE-3142

If a master dies and comes back up before his znode expires, the RS heartbeat can lock up

    XMLWordPrintableJSON

Details

    • Bug
    • Status: Closed
    • Critical
    • Resolution: Duplicate
    • 0.89.20100924, 0.90.0
    • 0.90.0
    • master, regionserver
    • None

    Description

      During a rolling restart, we ran into a case where a master was shutdown and then brought back up before the znode expired.

      On the RS side, while the master was down, it was getting ConnectionRefused exceptions trying to heartbeat to what it thinks is the active master.

      Once the master process comes back up, the next heartbeat done by all the RSs just blocks indefinitely.

      This is somewhat related to HBASE-3141

      Attachments

        Activity

          People

            ryanobjc ryan rawson
            streamy Jonathan Gray
            Votes:
            0 Vote for this issue
            Watchers:
            0 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved: