Uploaded image for project: 'Spark'
  1. Spark
  2. SPARK-27254

Cleanup complete but becoming invalid output files in ManifestFileCommitProtocol if job is aborted

    XMLWordPrintableJSON

Details

    • Improvement
    • Status: Resolved
    • Minor
    • Resolution: Fixed
    • 3.0.0
    • 3.0.0
    • Structured Streaming
    • None

    Description

      ManifestFileCommitProtocol doesn't clean up complete (but will become invalid) output files when job is aborted.

      ManifestFileCommitProtocol doesn't do anything for cleaning up when job is aborted but just maintains the metadata which list of complete output files are written. SPARK-27210 addressed for task level cleanup, but it still doesn't clean up it as job level.

      Attachments

        Issue Links

          Activity

            People

              kabhwan Jungtaek Lim
              kabhwan Jungtaek Lim
              Votes:
              0 Vote for this issue
              Watchers:
              2 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: