Details
-
Improvement
-
Status: Resolved
-
Major
-
Resolution: Fixed
-
3.0.0
Description
Per discussion on https://docs.google.com/document/d/1AyTdLU-RxA-Gsb9EsYnrQrmqPMOYMfPlWwxRi1Is1tQ
Add an ExecNode interface with which a streaming execution graph can be constructed. Initial concrete classes will include:
- ScanNode, which wraps a dataset and is a pure emitter of batches (initially, this will only wrap memory sized datasets such as tables. See
ARROW-11930) - FilterNode, which evaluates an expression on inputs and based on the result removes rows from batches (eventually, this may defer materialization of the selection to other kernels. See ARROW-5005 ARROW-10474)
- ProjectNode, which evaluates expressions on inputs producing new columns.
- GroupedAggregateNode, which computes aggregations grouped on one or more keys.
Attachments
Issue Links
- is a child of
-
ARROW-8894 [C++] C++ array kernels framework and execution buildout (umbrella issue)
- Open
-
ARROW-12633 [C++] Query engine umbrella issue
- Open
- supercedes
-
ARROW-7878 [C++] Implement LogicalPlan and LogicalPlanBuilder
- Closed
- links to