Details
-
Bug
-
Status: Closed
-
Major
-
Resolution: Won't Fix
-
0.94.16
-
None
-
None
-
None
-
A corrupt HFile or HFIle missed could cause endless attempts to assign the region without a chance of success
Description
As described in HBASE-9737, a corrupt HFile in a region could lead to an assignment storm in the cluster since the Master will keep trying to assign the region to each region server one after another and obviously none will succeed.
The region server, upon detecting such a scenario should mark the region as "RS_ZK_REGION_FAILED_ERROR" (or something to the effect) in the Zookeeper which should indicate the Master to stop assigning the region until the error has been resolved (via an HBase shell command, probably "assign"?)
Attachments
Attachments
Issue Links
- is related to
-
HBASE-9737 Corrupt HFile cause resource leak leading to Region Server OOM
- Closed
-
HBASE-9522 Allow region opening even if creation of some HFile Readers fail.
- Closed