Details
-
Bug
-
Status: Patch Available
-
Critical
-
Resolution: Unresolved
-
None
-
None
Description
Steps to reproduce
- Start 2 nodemanagers with NM recovery enabled
- Submit pi job with 20 maps
- Once 5 maps gets completed in NM 1 stop NM (yarn daemon stop nodemanager)
(Logs of all completed container gets aggregated to HDFS) - Now start the NM1 again and wait for job completion
The newly assigned container logs on NM1 are not shown
hdfs log dir state
- When logs are aggregated to HDFS during stop its with NAME (localhost_38153)
- On log aggregation after starting NM the newly assigned container logs gets uploaded with name (localhost_38153.tmp)
History server the logs are now shown for new task attempts