Uploaded image for project: 'HBase'
  1. HBase
  2. HBASE-5849

On first cluster startup, RS aborts if root znode is not available

    XMLWordPrintableJSON

Details

    • Reviewed
    • Rather than exit, the regionserver will now wait even though the root directory in zookeeper has yet to be created.

    Description

      When launching a fresh new cluster, the master has to be started first, which might create race conditions for starting master and rs at the same time.

      Master startup code is smt like this:

      • establish zk connection
      • create root znodes in zk (/hbase)
      • create ephemeral node for master /hbase/master,

      Region server start up code is smt like this:

      • establish zk connection
      • check whether the root znode (/hbase) is there. If not, shutdown.
      • wait for the master to create znodes /hbase/master

      So, the problem is on the very first launch of the cluster, RS aborts to start since /hbase znode might not have been created yet (only the master creates it if needed). Since /hbase/ is not deleted on cluster shutdown, on subsequent cluster starts, it does not matter which order the servers are started. So this affects only first launchs.

      Attachments

        1. 5849v3.txt
          5 kB
          Michael Stack
        2. HBASE-5849_v1.patch
          1.0 kB
          Enis Soztutar
        3. HBASE-5849_v2.patch
          5 kB
          Enis Soztutar
        4. HBASE-5849_v4.patch
          5 kB
          Enis Soztutar
        5. HBASE-5849_v4.patch
          5 kB
          Enis Soztutar
        6. HBASE-5849_v4.patch
          5 kB
          Enis Soztutar
        7. HBASE-5849_v4-0.92.patch
          5 kB
          Enis Soztutar

        Activity

          People

            enis Enis Soztutar
            enis Enis Soztutar
            Votes:
            0 Vote for this issue
            Watchers:
            2 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved: