Description
To distribute the hash cache to region servers, the master node makes an `AddServerCacheRequest` RPC to each region servers. If there are N region servers, it makes N of these RPC's. For each of the region servers, it generates a serialized RPC message and sends it out. This happens concurrently, and the result is that it uses O(N) memory on the master.
As an example, if the `AddServerCacheRequest` RPC message is 100MB, and you have a cluster of 100 nodes, it would use 10GB memory on the master, potentially resulting in an "OutOfMemory" exception.
It would be better if the master could use O(1) memory for the RPC.
I observed this behavior in Phoenix 4.14.1
Attachments
Issue Links
- links to