[IMPALA-9637] Scan range load-balancing within backend - ASF JIRA

XML

Word

Printable

JSON

Details

Type: Improvement
Status: Resolved
Priority: Major
Resolution: Duplicate
Affects Version/s: Impala 4.0.0
Fix Version/s: None
Component/s: Distributed Exec
Labels:
- multithreading
- performance

Target Version:

Product Backlog
Epic Color:
ghx-label-14

Description

Currently the scheduler statically divides scan ranges between fragment instances, Since ~~IMPALA-9015~~ it statically load-balances scan ranges based on file size using the LPT algorithm in the schedule.

This has various pitfalls:

It interacts badly with dynamic partition pruning, which can filter out a bunch of scan ranges and unbalance the laod
Different files that have the same byte size may involve different amounts of work to process for any number of reasons.

Those can cause both inter-node load balance problems and intra-node load balance problems. This Jira is about fixing the intra-node load balance problem, so that the situation is no worse than before mt_dop.

The proposed solution is to have a queue of scan ranges per backend, sorted from largest to smallest, and have each instance pull scan ranges off that queue. The DiskIOMgr ReaderContext probably is already sufficient to solve this problem, and we'll need to add a different mechanism for Kudu, Hbase, etc.

Attachments

Issue Links

duplicates

IMPALA-9654 Intra-node execution skew increase with mt_dop

In Progress

Activity

People

Assignee:: Unassigned

Reporter:: Tim Armstrong

Votes:: 0 Vote for this issue

Watchers:: 1 Start watching this issue

Dates

Created:: 09/Apr/20 20:53

Updated:: 15/Dec/20 19:54

Resolved:: 15/Dec/20 19:54