Details
-
Improvement
-
Status: Open
-
Major
-
Resolution: Unresolved
-
1.8.2
-
None
-
None
-
Patch, Important
Description
When processing big number of small files (with the same schema header), reading/writing each file need to read/write the head and parsing the json script, which is very slow.
If adding a new constructor in DataFileReader and DataFileWriter that allows pass in already parsed Schema object/script, then it will greatly improve the reading/writing performance for such cases.