Description
If the JT restarts or dies and running jobs are lost or the JT is not reachable, Oozie ActionCheckXCommand will never fail the workflow job.
There seem to be 2 issues here:
- convertException is not receiving the root cause exception anytmore, but alway HadoopAccessorException wrapping the root cause exception. We should modify the convertException to inspect the cause exception as well.
- ActionCheckXCommand does not do the handle retry logic of ActionStartXCommand.
Attachments
Attachments
Issue Links
- is depended upon by
-
OOZIE-1005 Tests from OOZIE-994 use wrong condition in waitFor
- Closed
- relates to
-
OOZIE-1011 Tests from OOZIE-994 fail when run against Hadoop trunk
- Closed