Details
-
Improvement
-
Status: Resolved
-
Minor
-
Resolution: Fixed
-
None
Description
Based on profiling from ARROW-10308, UTF8 validation is a bottleneck of CSV string conversion.
This is because we must validate many small UTF8 strings individually.
Attachments
Issue Links
- relates to
-
ARROW-10308 [Python] read_csv from python is slow on some work loads
- Open
- links to