Uploaded image for project: 'Spark'
  1. Spark
  2. SPARK-19702

Increasse refuse_seconds timeout in the Mesos Spark Dispatcher

    XMLWordPrintableJSON

Details

    • New Feature
    • Status: Resolved
    • Minor
    • Resolution: Fixed
    • 2.1.0
    • 2.2.0
    • Mesos
    • None

    Description

      Due to the problem described here: https://issues.apache.org/jira/browse/MESOS-6112, Running > 5 Mesos frameworks concurrently can result in starvation. For example, running 10 dispatchers could result in 5 of them getting all the offers, even if they have no jobs to launch. We must implement increase the refuse_seconds timeout to solve this problem. Another option would have been to implement suppress/revive, but that can cause starvation due to the unreliability of mesos RPC calls.

      Attachments

        Issue Links

          Activity

            People

              mgummelt Michael Gummelt
              mgummelt Michael Gummelt
              Votes:
              0 Vote for this issue
              Watchers:
              4 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: