Details
-
Improvement
-
Status: Resolved
-
Major
-
Resolution: Duplicate
-
1.0.0
-
None
Description
ApplicationState.MAX_NUM_RETRY, controlling the number of times standalone Executors can exit unsuccessfully before Master will remove the Application that the Executors are trying to run, is currently hard-coded to 10. There's no reason why this should be a single, fixed value for all standalone clusters (e.g., it should probably scale with the number of Executors), so it should be SparkConf-able.
Attachments
Issue Links
- is duplicated by
-
SPARK-16956 Make ApplicationState.MAX_NUM_RETRY configurable
- Resolved
- relates to
-
SPARK-2425 Standalone Master is too aggressive in removing Applications
- Closed
- links to