Details
-
Improvement
-
Status: Closed
-
Minor
-
Resolution: Incomplete
-
None
-
None
-
None
Description
Looking at this blog post about optimizing replication throughput for LinkedIn's Kafka:
http://sna-projects.com/blog/2011/01/optimizing-tcp-socket-across-data-centers/
It seems worth testing out if HBase replication connections can also benefit from increasing the socket receive buffer size on (expected to be) high-latency connections.
To this end, we would add a new configuration property for receive buffer size for replication connection and do some benchmarking to evaluate throughput with different values, verifying that making this configurable would have significant impact. For the moment, it seems best to scope the configuration setting to replication connections only, in order to avoid also impacting (negatively) intra-cluster communications.