Details
-
Bug
-
Status: Open
-
Major
-
Resolution: Unresolved
-
2.7.0
-
None
-
None
Description
Sporadic test failures occur because NameNode#stop used by the mini cluster does not properly manage the HA context's state. It directly calls HAState#exitState(context) instead of HAState#setState(context,state). The latter will properly lock the namesystem and update the ha state while locked, while the former does not. The result is that while the cluster is stopping, the lock is released and any queued rpc calls think the NN is still active and are processed while the NN is in an unstable half-stopped state.