Details
-
Improvement
-
Status: Resolved
-
Major
-
Resolution: Fixed
-
None
Description
If the write destination is slow then the dataset writer should back off and apply backpressure to pause the reader. This will allow simple dataset API scans to operate on large out of core datasets.
This is dependent on ARROW-13611 which adds a backpressure feature for regular scanning (but not writing data) and on ARROW-13542 which moves the dataset write to a node in the exec plan.
Attachments
Issue Links
- depends upon
-
ARROW-13611 [C++] Scanning datasets does not enforce back pressure
- Resolved
-
ARROW-13542 [C++][Compute][Dataset] Add dataset::WriteNode for writing rows from an ExecPlan to disk
- Resolved
- links to