[ARROW-9707] [Rust] [DataFusion] Re-implement threading model - ASF JIRA

XML

Word

Printable

JSON

Details

Type: Sub-task
Status: Closed
Priority: Major
Resolution: Invalid
Affects Version/s: None
Fix Version/s: 3.0.0
Component/s: Rust, Rust - DataFusion
Labels:
- pull-request-available

External issue URL:
https://github.com/apache/arrow/issues/17305

Description

The current threading model is very simple and does not scale. We currently use 1-2 dedicated threads per partition and they all run simultaneously, which is a huge problem if you have more partitions than logical or physical cores.

This task is to re-implement the threading model so that query execution uses a fixed (configurable) number of threads. Work will be broken down into stages and tasks and each in-process executor (running on a dedicated thread) will process its queue of tasks.

This process will be driven by a scheduler.

Attachments

- Sort By Name
- Sort By Date
- Ascending
- Descending

image-2020-09-24-22-46-46-959.png
24/Sep/20 20:46
252 kB
Adam Lippai

Issue Links

relates to

ARROW-10303 [Rust] Parallel type transformation in CSV reader

Closed

links to

GitHub Pull Request #8283

Activity

People

Assignee:: Andy Grove

Reporter:: Andy Grove

Votes:: 0 Vote for this issue

Watchers:: 7 Start watching this issue

Dates

Created:: 12/Aug/20 15:25

Updated:: 11/Jan/23 08:08

Resolved:: 14/Nov/20 16:00

Time Tracking

Estimated:

Not Specified

Remaining:

Logged: