Uploaded image for project: 'Spark'
  1. Spark
  2. SPARK-3570

Shuffle write time does not include time to open shuffle files

    XMLWordPrintableJSON

Details

    • Bug
    • Status: Closed
    • Major
    • Resolution: Fixed
    • 0.9.2, 1.0.2, 1.1.0
    • 1.3.1, 1.4.0
    • Spark Core
    • None

    Description

      Currently, the reported shuffle write time does not include time to open the shuffle files. This time can be very significant when the disk is highly utilized and many shuffle files exist on the machine (I'm not sure how severe this is in 1.0 onward – since shuffle files are automatically deleted, this may be less of an issue because there are fewer old files sitting around). In experiments I did, in extreme cases, adding the time to open files can increase the shuffle write time from 5ms (of a 2 second task) to 1 second. We should fix this for better performance debugging.

      Thanks shivaram for helping to diagnose this problem. cc pwendell

      Attachments

        1. 3a_1410854905_0_job_log_waterfall.pdf
          131 kB
          Kay Ousterhout
        2. 3a_1410957857_0_job_log_waterfall.pdf
          135 kB
          Kay Ousterhout

        Activity

          People

            kayousterhout Kay Ousterhout
            kayousterhout Kay Ousterhout
            Votes:
            0 Vote for this issue
            Watchers:
            2 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved: