Details
-
Improvement
-
Status: Closed
-
Major
-
Resolution: Fixed
-
None
Description
With large files, it is possible that multiple splits are created in the same file. With current codebase, "ParquetRecordReaderBase" ends up reading file footer for each split.
It can be optimized not to read footer information multiple times for the same file.
Attachments
Attachments
Issue Links
- relates to
-
HADOOP-18028 High performance S3A input stream with prefetching & caching
- Open
- links to