[ARROW-16599] [C++] Implementation of ExecuteScalarExpressionOverhead benchmarks without arrow for comparision - ASF JIRA

XML

Word

Printable

JSON

Details

Type: Sub-task
Status: Resolved
Priority: Minor
Resolution: Fixed
Affects Version/s: None
Fix Version/s: 9.0.0
Component/s: C++
Labels:
- pull-request-available

External issue URL:
https://github.com/apache/arrow/issues/20250

Description

The ExecuteScalarExpressionOverhead group of benchmarks for now gives us values we can compare to different batch sizes, or to different expressions. But we don't really see how well arrow does compared to what is possible in general.

The simple_expression and (negate x) complex_expression (x>0 and x<20) benchmarks, which perform an actual operation on data, can be implemented in pure C++ for comparison.

I implemented complex_expression benchmark using technically unnecessary intermediate buffers for the > and < operator results, to match what happens in the arrow expression.

What may seem unfair is that I currently re-use the input/output/intermediate buffers over all iterations. I also tried using new and delete each time, but could not measure a difference in performance. Reusing allowes to use std::vector for sightly cleaner code. Re-creating a vector each time would results in a lot of overhead initializing the vector values and is therefore not useful.

Example output: example-output-baseline.txt

Attachments

- Sort By Name
- Sort By Date
- Ascending
- Descending

example-output-baseline.txt
17/May/22 19:21
7 kB
Tobias Zagorni

Issue Links

links to

GitHub Pull Request #13179

Activity

People

Assignee:: Tobias Zagorni

Reporter:: Tobias Zagorni

Votes:: 0 Vote for this issue

Watchers:: 3 Start watching this issue

Dates

Created:: 17/May/22 19:18

Updated:: 11/Jan/23 11:45

Resolved:: 06/Jul/22 01:01

Time Tracking

Estimated:

Not Specified

Remaining:

Logged:

1.5h