Details
-
Improvement
-
Status: Closed
-
Major
-
Resolution: Not A Problem
-
1.11.3
-
None
-
None
Description
If a non-retryable error (e.g. the savepoint is corrupted or unaccessible) occurs during the initiation of the job manager, the job cluster exits with an error code. But since it does not mark the attempt as failed, it won't be count as a failed attempt, and YARN will keep retrying forever.