[OAK-8011] Benchmark on QUERY_DURATION metrics implemented in OAK-7904 - ASF JIRA

XML

Word

Printable

JSON

Details

Type: Task
Status: Closed
Priority: Major
Resolution: Not A Problem
Affects Version/s: None
Fix Version/s: 1.12.0
Component/s: indexing, query
Labels:
None

Description

As part of ~~OAK-7904~~, there are some possible performance concerns on adding additional metrics in code which is executed a lot.
See tmueller's comment:

Some comments on overhead of measuring:

We measure here very common, and very fast operations. I don't know how fast next() could be, but if everything is in memory, it could be faster than 600 ns. I measured the fastest measured operation was processed at 0.091904 milliseconds , that would be 91904 nanoseconds. Measures was this divided by 256, so just 359 nanoseconds.

System.nanoTime() can be slower than that, according to this older article it can be 650 nanoseconds. We need to call it twice to measure, so 1'300 nanoseconds. Meaning, measuring in the worst case seens so far slows down the operation by factor 4.6 (worst case seen so far).

What we could do is use org.apache.jackrabbit.oak.stats Clock.Fast, which has a much lower overhead than calling System.nanoTime(). The name "Fast" is somewhat of a misnomer: the clock isn't really faster than other clocks, it's just less overhead. So getting the current time is fast. Resolution is low, but that wouldn't be a problem in our case, it's just that most of the time, operations would be 0 ns, and rarely 100s of ns. On average, that would even out (same as with the sampling it is using right now). The problems with Clock.Fast are:

Hard to get a hand on this instance.
It uses a thread pool executor service, which is problematic. If the same service is used by other threads that take milliseconds, then the clock is extremely inaccurate. I would be better to use a simple, separate daemon thread.

Seeing that there is the possibility to enable/disable the metrics stats two separate benchmark tests can be run:

specifying the oak.query.timerDisabled system prop
without specifying the oak.query.timerDisabled system prop

Attachments

Issue Links

relates to

OAK-7904 Exporting query duration per index metrics with Sling Metrics / DropWizard

Closed

Activity

People

Assignee:: Unassigned

Reporter:: Paul Chibulcuteanu

Votes:: 0 Vote for this issue

Watchers:: 2 Start watching this issue

Dates

Created:: 29/Jan/19 15:06

Updated:: 08/Oct/19 15:21

Resolved:: 07/Feb/19 14:14