Uploaded image for project: 'HBase'
  1. HBase
  2. HBASE-17902

Backport HBASE-16367 "Race between master and region server initialization may lead to premature server abort" to 1.3

    XMLWordPrintableJSON

Details

    • Bug
    • Status: Resolved
    • Major
    • Resolution: Fixed
    • None
    • 1.3.2
    • None
    • None
    • Reviewed

    Description

      This is to fix the case where hbase master always dies shortly after start.

      It turned out that master initialization thread was racing with HRegionServer#preRegistrationInitialization() (initializeZooKeeper, actually) since HMaster extends HRegionServer.
      Through additional logging in master:

          this.oldLogDir = createInitialFileSystemLayout();
          HFileSystem.addLocationsOrderInterceptor(conf);
          LOG.info("creating splitLogManager");
      

      I found that execution didn't reach the last log line before region server declared cluster Id being null.

      branch-1.3 has been in quiet mode leading up to the release of 1.3.1
      Once 1.3.1 is released, the fix can go into branch-1.3

      Attachments

        1. 17902.branch-1.3.txt
          4 kB
          Ted Yu

        Issue Links

          Activity

            People

              yuzhihong@gmail.com Ted Yu
              yuzhihong@gmail.com Ted Yu
              Votes:
              0 Vote for this issue
              Watchers:
              5 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: