HBase
  1. HBase
  2. HBASE-5875

Process RIT and Master restart may remove an online server considering it as a dead server

    Details

    • Type: Bug Bug
    • Status: Closed
    • Priority: Major Major
    • Resolution: Fixed
    • Affects Version/s: 0.92.1
    • Fix Version/s: 0.94.1, 0.95.0
    • Component/s: None
    • Labels:
      None
    • Hadoop Flags:
      Reviewed

      Description

      If on master restart it finds the ROOT/META to be in RIT state, master tries to assign the ROOT region through ProcessRIT.

      Master will trigger the assignment and next will try to verify the Root Region Location.
      Root region location verification is done seeing if the RS has the region in its online list.
      If the master triggered assignment has not yet been completed in RS then the verify root region location will fail.
      Because it failed

      splitLogAndExpireIfOnline(currentRootServer);
      

      we do split log and also remove the server from online server list. Ideally here there is nothing to do in splitlog as no region server was restarted.

      So master, though the server is online, master just invalidates the region server.
      In a special case, if i have only one RS then my cluster will become non operative.

      1. HBASE-5875.patch
        2 kB
        ramkrishna.s.vasudevan
      2. HBASE-5875_0.94.patch
        17 kB
        ramkrishna.s.vasudevan
      3. HBASE-5875_0.94_1.patch
        17 kB
        ramkrishna.s.vasudevan
      4. HBASE-5875v2.patch
        3 kB
        chunhui shen
      5. HBASE-5875_trunk.patch
        3 kB
        rajeshbabu
      6. HBASE-5875_trunk.patch
        3 kB
        rajeshbabu
      7. HBASE-5875_0.94_2.patch
        4 kB
        ramkrishna.s.vasudevan
      8. HBASE-5875_trunk_1.patch
        3 kB
        ramkrishna.s.vasudevan

        Activity

          People

          • Assignee:
            ramkrishna.s.vasudevan
            Reporter:
            ramkrishna.s.vasudevan
          • Votes:
            0 Vote for this issue
            Watchers:
            12 Start watching this issue

            Dates

            • Created:
              Updated:
              Resolved:

              Development