Minimally, calling the total number of maps scheduled "Local map tasks" is confusing
Note that the counter will be incremented only if there is a locality at any level. Its different from calling all the maps as LOCAL. I agree that we should change the counter name. The intention is to show how many tasks got picked up from the cache. So it includes node-local, rack-local, switch-local etc. I think Aggregate Locality sounds better, comments?
For which users would this be valuable information?
In general when the users have cache topology more than 2 levels.
If there are more than two levels in the task cache and the distinction is significant, how is an aggregate counter resolving the ambiguity?
Consider a case where there is no locality at the node level and also at the rack level. As per the trunk there is no way to infer whether the scheduling went correctly or not. With this aggregate counter one can check if the maps were from the cache or not. So the counter is just a count of how may maps got picked up from the task cache.