Details
-
Bug
-
Status: Closed
-
Critical
-
Resolution: Fixed
-
2.0.0-alpha
-
None
-
Reviewed
Description
After fixing the datanode side of keepalive to properly disconnect stale clients, (HDFS-3357), the client side has the following issue: when it connects to a DN, it first tries to use cached sockets, and will try a configurable number of sockets from the cache. If there are more cached sockets than the configured number of retries, and all of them have been closed by the datanode side, then the client will throw an exception and mark the replica node as dead.