Uploaded image for project: 'HBase'
  1. HBase
  2. HBASE-19144

[RSgroups] Retry assignments in FAILED_OPEN state when servers (re)join the cluster

    XMLWordPrintableJSON

Details

    • Bug
    • Status: Resolved
    • Major
    • Resolution: Fixed
    • None
    • 1.4.0, 2.0.0
    • rsgroup
    • None
    • Reviewed
    • When regionserver placement groups (RSGroups) is active, as servers join the cluster the Master will attempt to reassign regions in FAILED_OPEN state.

    Description

      After all servers in the RSgroup are down the regions cannot be opened anywhere and transition rapidly into FAILED_OPEN state.

      2017-10-31 21:06:25,449 INFO [ProcedureExecutor-13] master.RegionStates: Transition

      {c6c8150c9f4b8df25ba32073f25a5143 state=OFFLINE, ts=1509483985448, server=node-5.cluster,16020,1509482700768}

      to

      {c6c8150c9f4b8df25ba32073f25a5143 state=FAILED_OPEN, ts=1509483985449, server=node-5.cluster,16020,1509482700768}

      2017-10-31 21:06:25,449 WARN [ProcedureExecutor-13] master.RegionStates: Failed to open/close d4e2f173e31ffad6aac140f4bd7b02bc on node-5.cluster,16020,1509482700768, set to FAILED_OPEN

      Any region in FAILED_OPEN state has to be manually reassigned, or the master can be restarted and this will also cause reattempt of assignment of any regions in FAILED_OPEN state. This is not unexpected but is an operational headache. It would be better if the RSGroupInfoManager could automatically kick reassignments of regions in FAILED_OPEN state when servers rejoin the cluster.

      Attachments

        1. HBASE-19144.patch
          7 kB
          Andrew Kyle Purtell
        2. HBASE-19144.patch
          7 kB
          Andrew Kyle Purtell
        3. HBASE-19144.patch
          6 kB
          Andrew Kyle Purtell
        4. HBASE-19144-branch-1.patch
          8 kB
          Andrew Kyle Purtell
        5. HBASE-19144-branch-1.patch
          7 kB
          Andrew Kyle Purtell

        Activity

          People

            apurtell Andrew Kyle Purtell
            apurtell Andrew Kyle Purtell
            Votes:
            0 Vote for this issue
            Watchers:
            4 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved: