Details
-
Improvement
-
Status: In Progress
-
Major
-
Resolution: Unresolved
-
1.14.0
-
None
-
None
-
Patch
Description
ParquetOutputFormat should support custom OutputCommitter.
There is a need to bypass current Hadoop functionality of writing output data under _temporary folder. Especially with AWS S3, there can be huge overhead of moving the files from _temporary folder to output folder.
Attachments
Issue Links
- is depended upon by
-
PARQUET-2486 Improve Parquet IO Performance within cloud datalakes
- In Progress
- links to