Uploaded image for project: 'Spark'
  1. Spark
  2. SPARK-21171

Speculate task scheduling block dirve handle normal task when a job task number more than one hundred thousand

    XMLWordPrintableJSON

Details

    • Improvement
    • Status: Resolved
    • Major
    • Resolution: Invalid
    • 2.0.0
    • None
    • Block Manager, Spark Core
    • None
    • We have more than two hundred high-performance machine to handle more than 2T data by one query

    Description

      If a job have more then one hundred thousand tasks and spark.speculation is true, when speculable tasks start, choosing a speculable will waste lots of time and block other tasks. We do a ad-hoc query for data analyse, we can't tolerate one job wasting time even it is a large job

      Attachments

        Activity

          People

            Unassigned Unassigned
            tygxk wangminfeng
            Votes:
            0 Vote for this issue
            Watchers:
            1 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved: