Details
-
Improvement
-
Status: Open
-
Minor
-
Resolution: Unresolved
-
Impala 2.3.0
-
None
Description
In our BE execution, we currently call QueryMaintenance() and check for cancellation very generously, e.g., at the beginning of every GetNext() or in tight loops that could run for a long time (e.g. due to selective predicates).
Our current pattern of usage has several problems:
1. QueryMaintenance() is expensive, but the function name is rather opaque to this fact. It traverses the mem tracker hierarchy checking for the mem limit, and also checks the query status under a spin lock. It is not clear that these checks need to be coupled and called so frequently. We should place these checks judiciously/appropriately because QueryMaintenance() can become a performance bottleneck especially in subplan execution.
2. We always couple QueryMaintenance() with checking for cancellation. There is an existing TODO in the code to address this (see runtime-state.cc RuntimeState::CheckQueryState()).
See these code reviews for additional discussion:
http://gerrit.cloudera.org:8080/894
http://gerrit.cloudera.org:8080/901
Attachments
Issue Links
- depends upon
-
IMPALA-5844 Fix management of FunctionContext "local" allocations.
- Resolved
- is blocked by
-
IMPALA-2384 Check memory limits in RowBatch constructor and RowBatch::Reset().
- Open
- is related to
-
IMPALA-6997 Query execution should notice UDF MemLimitExceeded errors more quickly
- Resolved
-
IMPALA-2712 Investigate increasing batch size
- Resolved
We should be able to get rid of the spin lock on this path altogether – Status is really just a single pointer. First we need to improve our Atomic implementation though because it's conservative on its used of locked instructions.