Details
-
Improvement
-
Status: Closed
-
Blocker
-
Resolution: Fixed
-
None
-
None
-
None
Description
Internally we have a cluster of thousands of regions. We just did a hbase restart w/ master on new node. Just so happened that one of the regionservers was running extra slow (was downloaded by other processes). Meant that its portion of the assigments was taking a long time to come up... While these regions were stuck in deploy mode, the cluster is not useable.
We need a sort of 'safe-mode' in hbase where clients fail if they try to attach to a cluster not yet fully up. UI should show when all assignments have been successfully made so admin can at least see when they have a problematic regionserver in their midst.