Uploaded image for project: 'Spark'
  1. Spark
  2. SPARK-18968

.sparkStaging quickly fill up HDFS

    XMLWordPrintableJSON

Details

    • Bug
    • Status: Resolved
    • Major
    • Resolution: Duplicate
    • 1.6.2
    • None
    • DStreams
    • None

    Description

      We are running streaming jobs using spark. Even the "spark.yarn.preserve.staging.files" is set to b "false", HDFS is quickly been filled up.

      Also find people ask mail-list similar question but no further response. http://apache-spark-user-list.1001560.n3.nabble.com/HDFS-folder-sparkStaging-not-deleted-and-filled-up-HDFS-in-yarn-mode-td7851.html

      Attachments

        Issue Links

          Activity

            People

              Unassigned Unassigned
              airbots Chen He
              Votes:
              0 Vote for this issue
              Watchers:
              1 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: