Include but not limited to
1. Clean-up time
2. Time to complete one clean up loop Time.
3. Disk usage by logs before cleanup and After cleanup loop. ( Just like GC.?)
5. Search request Cnt: By category - Archived/non-archived
6. Search Request - Response time
7. Search Request - 0 result Cnt
8. Search Result - open files
9. File partial read count
10. File Download request Cnt/ And Size served
11. Disk IO by logviewer
12. CPU usage ( for unzipping files)
- Topology stormjar.ser/stormconf.ser/stormser.ser file upload time.
- Scheduler related metrics would be a long list generic and specific to different strategies.
- Most if not all cluster summary can be pushed as Metrics.
- Restart cnt
- Nimbus loss of leadership
- UI not responding (https://jira.ouroath.com/browse/YSTORM-4838)
- Negative resource scheduling events (https://jira.ouroath.com/browse/YSTORM-4940)
- Excessive scheduling time