Details
-
Improvement
-
Status: Resolved
-
Major
-
Resolution: Fixed
-
1.17.0
Description
The current implementation of the Excel reader uses the Apache POI reader, which uses excessive amounts of memory. As a result, attempting to read large Excel files will cause out of memory errors.
This PR converts the format plugin to use a streaming reader, based still on the POI library. The documentation for the streaming reader can be found here. [1]
All unit tests pass and I tested the plugin with some large Excel files on my computer.
[1]: https://github.com/pjfanning/excel-streaming-reader
Attachments
Issue Links
- fixes
-
DRILL-7423 Create More Efficient Way to Read Excel Cells
- Resolved
- links to