Details
-
Bug
-
Status: Closed
-
Major
-
Resolution: Fixed
-
2.7.1, 2.6.2
-
None
Description
This is related to YARN-3238. Ran into more scenarios where connection errors are being retried at multiple levels, like NoRouteToHostException. The fix for YARN-3238 was too specific, and I think we need a more general solution to catch a wider array of connection errors that can occur to avoid retrying them both at the RPC layer and at the NM proxy layer.
Attachments
Attachments
Issue Links
- relates to
-
YARN-3238 Connection timeouts to nodemanagers are retried at multiple levels
- Closed