Uploaded image for project: 'Spark'
  1. Spark
  2. SPARK-14180

Deadlock in CoarseGrainedExecutorBackend Shutdown

    XMLWordPrintableJSON

Details

    • Bug
    • Status: Resolved
    • Critical
    • Resolution: Incomplete
    • None
    • None
    • None
    • master branch. commit d6dc12ef0146ae409834c78737c116050961f350

    Description

      I'm fairly certain that https://github.com/apache/spark/pull/11031 introduced a deadlock in executor shutdown. The result is executor shutdown hangs indefinitely. In Mesos at least, this lasts until spark.mesos.coarse.shutdownTimeout (default 10s), at which point the driver stops, which force kills the executors.

      The deadlock is as follows:

      Attachments

        Activity

          People

            Unassigned Unassigned
            mgummelt Michael Gummelt
            Votes:
            0 Vote for this issue
            Watchers:
            3 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved: