[SPARK-16956] Make ApplicationState.MAX_NUM_RETRY configurable - ASF JIRA

XML

Word

Printable

JSON

Details

Type: New Feature
Status: Resolved
Priority: Major
Resolution: Fixed
Affects Version/s: None
Fix Version/s: 1.6.3, 2.0.1, 2.1.0
Component/s: Deploy
Labels:
None

Target Version/s:

1.6.3, 2.0.1

Description

The ApplicationState.MAX_NUM_RETRY setting, which controls the maximum number of back-to-back executor failures that the standalone cluster manager will tolerate before removing a faulty application, is currently a hardcoded constant (10), but there are use-cases for making it configurable (TBD in my PR). We should add a new configuration key to let users customize this.

Attachments

Issue Links

duplicates

SPARK-2424 ApplicationState.MAX_NUM_RETRY should be configurable

Resolved

links to

[Github] Pull Request #14544 (JoshRosen)

Activity

People

Assignee:: Josh Rosen

Reporter:: Josh Rosen

Votes:: 0 Vote for this issue

Watchers:: 1 Start watching this issue

Dates

Created:: 08/Aug/16 19:27

Updated:: 12/Sep/16 22:36

Resolved:: 09/Aug/16 18:25