Details
-
Sub-task
-
Status: Resolved
-
Major
-
Resolution: Won't Fix
-
3.2.0
-
None
-
None
-
None
Description
the S3A committers rely on an atomic PUT to save a JSON summary of the job to the dest FS, containing files, statistics, etc. This is for internal testing, but it turns out to be useful for spark integration testing, Hive, etc.
IBM's stocator also generated a manifest.
Proposed: come up with (an extensible) design that we are happy with as a long lived format.
Attachments
Issue Links
- relates to
-
HIVE-16295 Add support for using Hadoop's S3A OutputCommitter
- Patch Available
-
SPARK-23977 Add commit protocol binding to Hadoop 3.1 PathOutputCommitter mechanism
- Resolved