Details
-
Bug
-
Status: Closed
-
Major
-
Resolution: Fixed
-
2.1.0-beta, 3.0.0-alpha1
-
None
-
Reviewed
Description
We've seen an issue in a secure cluster where, after a failover, the new NN isn't able to properly coordinate QJM recovery. The JNs fail to fetch logs from each other due to apparently not having a Kerberos TGT. It seems that we need to add the checkTGTAndReloginFromKeytab call prior to making the HTTP connection, since the java HTTP stuff doesn't do an automatic relogin