Uploaded image for project: 'Spark'
  1. Spark
  2. SPARK-18733

Spark history server file cleaner excludes in-progress files

    XMLWordPrintableJSON

Details

    • Bug
    • Status: Resolved
    • Major
    • Resolution: Duplicate
    • 2.0.2
    • None
    • Web UI
    • None

    Description

      When we restart history server, it does spend a lot of time to load/replay incomplete applications which mean the inprogress log files in the log folder.

      We have already enabled "spark.history.fs.cleaner.enabled" but seems like it's skipping the inprogress files.

      I checked the log folder and saw that there are many old orphan files. Probably files left over due to spark-driver failures or OOMs.

      Attachments

        Issue Links

          Activity

            People

              Unassigned Unassigned
              eseyfe Ergin Seyfe
              Votes:
              0 Vote for this issue
              Watchers:
              5 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: