Details
-
Improvement
-
Status: Resolved
-
Minor
-
Resolution: Fixed
-
Impala 2.2, Impala 2.3.0
-
None
Description
We received reports of excessive codegen compilation/optimization times for very large expressions generated by visualization tools.
We should:
- Expose codegen optimization levels as query options. Currently there is only an all or nothing codegen query option. It's likely that overly complex expressions such as hundreds of cascading conditions take very long and benefit very little from an O2 optimization level, but they could still run significantly faster even at O0 or O1 versus interpreted.
- Consider dropping to O1 (or turn off riskier passes individually) automatically for very large expressions.
- Consider parameterizing the compilation duration time limits and set a reasonable default, say 10 seconds. Either disable codegen or reduce it to, say, O0 if compilation takes longer than the preset limit.
Workaround
In some cases disabling codegen can help.
SET disable_codegen=true;
Attachments
Issue Links
- is duplicated by
-
IMPALA-3262 Investigate Codegen Performance
- Resolved
- relates to
-
IMPALA-3259 Codegen is not cancellable and can use a lot of CPU and memory
- Resolved
-
IMPALA-5443 Consider automatically disabling codegen per ExecNode based on planner estimates
- Open
-
IMPALA-5081 Expose IR optimization level via query option
- Resolved
-
IMPALA-5444 Asynchronous code generation
- Resolved