Description
The NN and SBN do this dance during checkpoint image transfer with nested HTTP GETs via HttpURLConnection. When an admin does a -transitionToActive during this transfer, part of that is interrupting an ongoing checkpoint so we can transition immediately.
However, the thread.interrupt() in StandbyCheckpointer#stop gets swallowed by connection.getResponseCode() in TransferFsImage#doGetUrl. None of the methods in HttpURLConnection throw InterruptedException, so we need to do something else (perhaps HttpClient [1]):
Attachments
Attachments
Issue Links
- is related to
-
HDFS-6243 HA NameNode transition to active or shutdown may leave lingering image transfer thread.
- Closed