Uploaded image for project: 'HBase'
  1. HBase
  2. HBASE-14000

Region server failed to report to Master and was stuck in reportForDuty retry loop

    XMLWordPrintableJSON

Details

    • Bug
    • Status: Closed
    • Major
    • Resolution: Fixed
    • None
    • 1.2.0, 1.1.2, 1.3.0, 2.0.0
    • regionserver
    • None
    • Reviewed

    Description

      In a HA cluster, region server got stuck in reportForDuty retry loop if the active master is restarting and later on master switch happens before it reports successfully.

      Root cause is same as HBASE-13317, but the region server tried to connect master when it was starting, so rssStub reset didnt happen as

        if (ioe instanceof ServerNotRunningYetException) {
      	LOG.debug("Master is not running yet");
        }
      

      When master starts, master switch happened. So RS always tried to connect to standby master.

      Attachments

        1. HBASE-14000.patch
          0.8 kB
          Pankaj Kumar
        2. HM_RS-Log_snippet.txt
          25 kB
          Pankaj Kumar

        Activity

          People

            pankaj2461 Pankaj Kumar
            pankaj2461 Pankaj Kumar
            Votes:
            0 Vote for this issue
            Watchers:
            5 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved: