Details
-
New Feature
-
Status: Resolved
-
Major
-
Resolution: Won't Fix
-
2.6.0
-
None
-
None
-
None
Description
Turning on log aggregation allows users to easily store container logs in HDFS and subsequently view them in the YARN web UIs from a central place. Currently, there is a separate log file for each Node Manager. This can be a problem for HDFS if you have a cluster with many nodes as you’ll slowly start accumulating many (possibly small) files per YARN application. The current “solution” for this problem is to configure YARN (actually the JHS) to automatically delete these files after some amount of time.
We should improve this by compacting the per-node aggregated log files into one log file per application.
Attachments
Attachments
Issue Links
- is related to
-
MAPREDUCE-6375 Modify the JHS to be able to read the ConcatenatableAggregatedLogFormat
- Resolved
-
MAPREDUCE-6415 Create a tool to combine aggregated logs into HAR files
- Resolved
- relates to
-
YARN-2548 Find a more scalable way to handle logs for long running service
- Open