Description
I ended up running some experiments integrating Nutch and Celeborn (Incubating) and it got me thinking about runtime timings. After some investigation I came across common-lang3's StopWatch Class which provides a convenient API for timings.
Seeing as we already declare the commons-lang3 dependency, I think StopWatch could help us clean up some timing logic in Nutch. Specifically, it would reduce redundancy in terms of duplicated code and logic. It would also open the door to introduce timing splits if anyone is so inclined to dig deeper into runtime timings.
A cursory search for "long start = System.currentTimeMillis();" returns hits for 32 files so it's fair to say that timing already affects lots of aspects of the Nutch execution workflow.
Attachments
Issue Links
- links to