Uploaded image for project: 'Spark'
  1. Spark
  2. SPARK-24992

spark should randomize yarn local dir selection

    XMLWordPrintableJSON

Details

    • Improvement
    • Status: Resolved
    • Minor
    • Resolution: Fixed
    • 2.3.1
    • 2.4.0
    • Spark Core
    • None

    Description

      Utils.getLocalDir is used to get path of a temporary directory. However, it always returns the the same directory, which is the first element in the array localRootDirs. When running on YARN, this might causes the case that we always write to one disk, which makes it busy while other disks are free. We should randomize the selection to spread out the loads. 

      Attachments

        Activity

          People

            hthuynh2 Hieu Tri Huynh
            hthuynh2 Hieu Tri Huynh
            Votes:
            0 Vote for this issue
            Watchers:
            4 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved: