Details
-
Improvement
-
Status: Closed
-
Minor
-
Resolution: Not A Bug
-
0.15.0
-
None
-
None
Description
As part of the recent reader work we introduced
// arrow::record_batch::RecordBatchReader
but in datafusion we have
// datafusion::physical_plan::BatchIterator
These two trait are almost identical (BatchIterator implements Send + Sync whereas RecordBatchReader does not). I propose we replace RecordBatchReader with BatchIterator (i.e. move it to arrow as it's generally useful outside of datafusion) and update parquet and data fusion accordingly.
andygrove liurenjie1024 do you see any issues with this?