I've attached a patch with the scripts I built at Facebook. There is also a readme with an explanation of what they do. Basically there are two scripts: One that just parses the history logs and jobconfs and puts the exact same data into MySQL, creating tables of jobs, jobconf XML key-value pairs, tasks and task attempts, and then a second script that performs some joins on these tables to create a set of job summary reports with ~1 KB of data per job that can be used to run queries quickly for the purposes of visualization. You can build a visualization based on this database using your favorite tool. Unfortunately I can't open source the one used at Facebook because it depends on a lot of internal web libraries.