Details
-
Bug
-
Status: Open
-
Major
-
Resolution: Unresolved
-
None
-
None
-
None
-
None
Description
TaskTracker, DataNode, and SecondaryNameNode currently wait forever if its server is not up. They should be designed to take a configuration parameter that tells them when to give up, and a default value of many minutes/hours or more to deal with basic choreography issues in a cluster. Test clusters can be set up to fail sooner rather than later.
Attachments
Issue Links
- depends upon
-
HADOOP-4659 Root cause of connection failure is being lost to code that uses it for delaying startup
- Closed
-
HADOOP-6435 Make RPC.waitForProxy with timeout public
- Closed