Uploaded image for project: 'Spark'
  1. Spark
  2. SPARK-23626

DAGScheduler blocked due to JobSubmitted event

    XMLWordPrintableJSON

Details

    • Bug
    • Status: Resolved
    • Major
    • Resolution: Fixed
    • 2.2.1, 2.3.3, 2.4.3, 3.0.0
    • 3.1.3, 3.0.4, 3.2.1, 3.3.0
    • Scheduler, Spark Core
    • None

    Description

      DAGScheduler becomes a bottleneck in cluster when multiple JobSubmitted events has to be processed as DAGSchedulerEventProcessLoop is single threaded and it will block other tasks in queue like TaskCompletion.

      The JobSubmitted event is time consuming depending on the nature of the job (Example: calculating parent stage dependencies, shuffle dependencies, partitions) and thus it blocks all the events to be processed.

       

      I see multiple JIRA referring to this behavior

      https://issues.apache.org/jira/browse/SPARK-2647

      https://issues.apache.org/jira/browse/SPARK-4961

       

      Similarly in my cluster some jobs partition calculation is time consuming (Similar to stack at SPARK-2647) hence it slows down the spark DAGSchedulerEventProcessLoop which results in user jobs to slowdown, even if its tasks are finished within seconds, as TaskCompletion Events are processed at a slower rate due to blockage.

      Attachments

        Issue Links

          Activity

            People

              joshrosen Josh Rosen
              ajithshetty Ajith S
              Votes:
              0 Vote for this issue
              Watchers:
              5 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: