Uploaded image for project: 'HBase'
  1. HBase
  2. HBASE-20728

Failure and recovery of all RSes in a RSgroup requires master restart for region assignments

    XMLWordPrintableJSON

Details

    • Bug
    • Status: Resolved
    • Minor
    • Resolution: Incomplete
    • None
    • None
    • master, rsgroup
    • None

    Description

      If all the RSes in a RSgroup hosting user tables fail and recover, master still looks for old RSes (with old timestamp in the RS identifier) to assign regions. i.e. Regions are left in transition making the tables in the RSGroup unavailable. User need to restart master or manually assign the regions to make the tables available. Steps to recreate the scenario in a local cluster

      • Add required properties to site.xml to enable rsgroup and start hbase
      • Bring up multiple region servers using local-regionservers.sh start
      • Create a rsgroup and move a subset of  regionservers to the group
      • Create a table, move it to the group and put some data
      • Stop the regionservers in the group and restart them
      • From the master UI, we can see that the region for the table in transition and the RS name in the RIT message has the old timestamp.

      Attachments

        Activity

          People

            Unassigned Unassigned
            gsbiju Biju Nair
            Votes:
            0 Vote for this issue
            Watchers:
            9 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved: