Uploaded image for project: 'Spark'
  1. Spark
  2. SPARK-29994

Add WILDCARD task location

    XMLWordPrintableJSON

Details

    • Task
    • Status: Open
    • Major
    • Resolution: Unresolved
    • 3.0.0
    • None
    • Spark Core
    • None

    Description

      We have a "locality wait time" configuration that allows a task set to downgrade locality requirement after a certain time has passed. Yet, tasks all differ in penalty of locality miss. We propose a WILDCARD task location for tasks that do have a preferred location but would rather be assigned sooner than wait if the desired location is not available.

      This requirement comes from LocalShuffledRowRDD. When the number of initial mappers of LocalShuffledRDD is smaller than the number of worker nodes, it can cause serious regressions if short-running tasks all wait on their preferred locations while they could have otherwise finished quickly on non-preferred locations too.

      Attachments

        Issue Links

          Activity

            People

              Unassigned Unassigned
              maryannxue Wei Xue
              Votes:
              0 Vote for this issue
              Watchers:
              2 Start watching this issue

              Dates

                Created:
                Updated: