Details
-
Sub-task
-
Status: Patch Available
-
Major
-
Resolution: Unresolved
-
None
-
None
-
None
Description
I wrote a stress test with DFSNetworkTopology to verify its correctness under huge number of datanode changes e.g., data node insert/delete, storage addition/removal etc. The goal is to show that the topology maintains the correct counters all time. The test is written that, unless manually terminated, it will keep randomly performing the operations nonstop. (and because of this, the test is ignored in the patch).
My local test lasted 40 min before I stopped it, it involved more than one million datanode changes, and no error happened. We believe this should be sufficient to show the correctness of DFSNetworkTopology