Details
-
Improvement
-
Status: Resolved
-
Major
-
Resolution: Fixed
-
None
-
None
-
None
Description
We've seen a strong need for validating running samza jobs programmatically. Consider the following scenarios:
1) running samza as a service (as in SAMZA-849), which requires automated job validation when upgrading the framework for large number of jobs;
2) production monitoring, automated validation can run periodically and trigger alerts;
3) integration testing, which can use validation tool to check the job status.
So a tool is needed to provide the checkout of job health. Minimally it will check:
- job submission
- app running
- container count
It should also be able to extend and validate the job metrics through metrics system, such as JMX. The user will be able to validate against important metrics such as process_count or message_behind_high_watermark.