Details
-
Sub-task
-
Status: Resolved
-
Major
-
Resolution: Fixed
-
2.7.0
-
None
-
Reviewed
Description
When fcq backoff is enabled and an abusive client overflows the call queue, its connection is closed, as well as subsequent good client connections. Disconnects are very disruptive, esp. to multi-threaded clients with multiple outstanding requests, or clients w/o a retry proxy (ex. datanodes).
Until the abusive user is downgraded to a lower priority queue, disconnect/reconnect mayhem occurs which significantly degrades performance. Server metrics look good despite horrible client latency.
The fcq should utilize selective ipc disconnects to avoid pushback disconnecting good clients.
Attachments
Attachments
Issue Links
- duplicates
-
HADOOP-14440 Add metrics for connections dropped
- Resolved