Details
-
Improvement
-
Status: Resolved
-
Major
-
Resolution: Fixed
-
3.0.0-alpha-1, 2.2.0, 2.1.1, 2.0.2
-
None
-
None
Description
When testing sync replication, I found that, if I transit the remote cluster to DA, while the local cluster is still in A, the region server will hang when shutdown. As the fsOk flag only test the local cluster(which is reasonable), we will enter the waitOnAllRegionsToClose, and since the WAL is broken(the remote wal directory is gone) so we will never succeed. And this lead to an infinite wait inside waitOnAllRegionsToClose.
So I think here we should have an upper bound for the wait time in waitOnAllRegionsToClose method.
Attachments
Attachments
Issue Links
- relates to
-
HBASE-21932 Use Runtime.getRuntime().halt to terminate regionserver when abort timeout
- Resolved
- links to