[IMPALA-6115] Investigate scheduling options for block metadata loading threads - ASF JIRA

XML

Word

Printable

JSON

Details

Type: Improvement
Status: Open
Priority: Minor
Resolution: Unresolved
Affects Version/s: Impala 2.11.0
Fix Version/s: None
Component/s: Catalog
Labels:
- metadata
- performance

Epic Color:
ghx-label-4

Description

In the current implementation of ~~IMPALA-5429~~, we spin up an executor pool of threads every time we load the table and close it once the load has finished/failed. We need to investigate the following options and see if they can give better throughput.

1. Single thread pool for the process lifetime to minimize the startup and teardown latency and all the block metadata loading tasks are submitted to it.
2. Clustered scheduling of threads rather than submitting them all at once.
3. Any other scheduling options that guarantee fairness to all the tables and are more performant.

Attachments

Issue Links

is related to

IMPALA-5429 Use a thread pool to load block metadata in parallel

Resolved

Activity

People

Assignee:: Unassigned

Reporter:: Bharath Vissapragada

Votes:: 0 Vote for this issue

Watchers:: 2 Start watching this issue

Dates

Created:: 26/Oct/17 23:04

Updated:: 07/Feb/23 11:09