Details
-
Bug
-
Status: Closed
-
Blocker
-
Resolution: Fixed
-
0.15.0
-
None
-
None
Description
The JobTracker#TaskCommitQueue#run method only handles IOException}}s. Christian Kunz ran into a scenario where a job was stuck with all tasks in {{COMMIT_PENDING state and the stack traces showed that the "Task Commit Thread" wasn't even around.
The work-around is to model TaskCommitQueue#run along the lines of other long-running threads in the JobTracer (ExpireLaunchingTasks, ExpireTrackers etc.) to catch, log and ignore any Exception in a loop.