Details
-
New Feature
-
Status: Resolved
-
Critical
-
Resolution: Fixed
-
None
-
None
-
None
Description
Consumers of HistoryLoggingService reported counters are currently required to compute any task-level aggregations other than "sum". This is inefficient as Tez is already "scanning" over this data. Computing incremental aggregates shouldn't require additional scans by ATS consumers.
Provide an option for Task counter aggregations other than "sum". Computation of these extra counters can be turned on/off.
The option will generate "synthetic" counters at final aggregation time for reporting to HistoryLoggingService, e.g. MAX_GC_TIME_MILLIS.
Only incremental aggregations will be supported (min/max/avg). Aggregation computation will be folded into the existing "aggregation loop" beginning at VertexImpl.incrTaskCounters.
Extra aggregations will only be supported during final counter aggregation.
Aggregations will only include the "bestAttempt" for each task.
A design doc will be provided.
Because final task aggregation holds a lock, a performance report will be provided.