Details
-
Improvement
-
Status: Resolved
-
Minor
-
Resolution: Fixed
-
2.0.0
-
None
-
Spark cluster with 8 r3.8xl EC2 worker instances
Description
While debugging the performance of a large GraphX connected components computation, I found several places in the network-common and network-shuffle code bases where trace or debug log messages are constructed even if the respective log level is disabled. Refactoring the respective code to avoid these constructions except where necessary led to a modest but measurable reduction in task time, GC time and the ratio thereof.