Details
-
Bug
-
Status: Resolved
-
Major
-
Resolution: Fixed
-
7.0.0
Description
There is a flag enable_parallel_column_conversion which was passed down from python to C++ when reading parquet datasets which controlled whether we would read columns in parallel. This was allowed for single files but not for reading multiple files. This was an old check to help prevent nested deadlock.
Nested deadlock is no longer an issue and the flag was mostly inert once we removed the synchronous scanner.
Unfortunately, when we removed the synchronous scanner we forgot to remove this flag and the result was that a single-file read ended up disabling parallelism.
Attachments
Issue Links
- links to