Uploaded image for project: 'Spark'
  1. Spark
  2. SPARK-18733

Spark history server file cleaner excludes in-progress files

Rank to TopRank to BottomAttach filesAttach ScreenshotBulk Copy AttachmentsBulk Move AttachmentsVotersWatch issueWatchersCreate sub-taskConvert to sub-taskLinkCloneLabelsUpdate Comment AuthorReplace String in CommentUpdate Comment VisibilityDelete Comments
    XMLWordPrintableJSON

Details

    • Bug
    • Status: Resolved
    • Major
    • Resolution: Duplicate
    • 2.0.2
    • None
    • Web UI
    • None

    Description

      When we restart history server, it does spend a lot of time to load/replay incomplete applications which mean the inprogress log files in the log folder.

      We have already enabled "spark.history.fs.cleaner.enabled" but seems like it's skipping the inprogress files.

      I checked the log folder and saw that there are many old orphan files. Probably files left over due to spark-driver failures or OOMs.

      Attachments

        Issue Links

        Activity

          This comment will be Viewable by All Users Viewable by All Users
          Cancel

          People

            Unassigned Unassigned
            eseyfe Ergin Seyfe
            Votes:
            0 Vote for this issue
            Watchers:
            5 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved:

              Slack

                Issue deployment