YARN currently does not have any utility that would enable cluster administrators to display previous actions in a Hadoop YARN cluster in an offline fashion.
Since ATS is not widespread (is not available for older clusters) and there isn't a single file or entity that would collect all the application/container etc. related information, we thought our best option to parse and process the output of the YARN daemon log files and reconstruct the history of the cluster from that. We designed and implemented a CLI based solution that after parsing the log file enables users to query app/container related information (listing, filtering by certain properties) and search for common errors like CE failures/error codes, AM preemption or stack traces. The tool can be integrated into the YARN project as a sub-project.