Details
-
Improvement
-
Status: Patch Available
-
Major
-
Resolution: Unresolved
-
None
-
None
-
None
Description
When a cluster contains millions of DNs, GetDatanodeReport is pretty expensive, and it will cause a large GC pressure on NameNode.
When multiple NSs share the millions DNs by federation and the router listens to the NSs, the problem will be more serious.
All the NSs will be GC at the same time.
RBF should cache the datanode report informations and have an option to disable the cache.