I am not sure if this bug was already raised in JIRA.
In our test cluster we had a scenario where the RS had gone down and ServerShutDownHandler started with splitLog.
But as the HDFS was down the check waitOnSafeMode throws IOException.
We catch the exception
So the HLog split itself did not happen. We encontered like 4 regions that was recently splitted in the crashed RS was lost.
Can we abort the Master in such scenarios? Pls suggest.