[LUCENE-2050] Improve contrib/benchmark for testing near-real-time search performance - ASF JIRA

XML

Word

Printable

JSON

Details

Type: Improvement
Status: Closed
Priority: Minor
Resolution: Fixed
Affects Version/s: None
Fix Version/s: 3.0
Component/s: modules/benchmark
Labels:
None

Lucene Fields:

New

Description

It's not easy to test NRT performance right now w/ contrib/benchmark.
I've made some initial fixes to improve this:

Added new '&', that can follow any task within a serial sequence,
to "background" the task (just like a shell). The test runs in
the BG, and then at the end of all serial tasks, any still running
BG tasks are stopped & joined.

Added WaitTask that simply waits; useful for controlling how long
the BG'd tasks get to run.

Added RollbackIndex task, which is real handy for using a given
index for an NRT test, doing a bunch of updates, then reverting it
all so your next run uses the same starting index

Fixed the existing NearRealTimeReaderTask to simply periodically
open the new reader (previously it was also running a fixed
search), and removed its own threading (since & can do that
now). It periodically wakes up, opens the new reader, and swaps it
into the PerfRunData, at the schedule you specify. I switched all
usage of PerfRunData's get/setIndexReader APIs to use ref
counting.

With these changes you can now make some very simple but powerful
algs, eg:

OpenIndex
{
  NearRealtimeReader(0.5) &
  # Warm
  Search
  { "Index1" AddDoc > : * : 100/sec &
  [ { "Search" Search > : * ] : 4 &
  Wait(30.0)
}
CloseReader
RollbackIndex
RepSumByName

This alg first opens the IndexWriter, then spawns the BG thread to
reopen the NRT reader twice per second, does one warming Search (in
the FG), spans a new thread to index documents at the rate of 100 per
second, then spawns 4 search threads that do as many searches as they
can. We then wait for 30 seconds, then stop all the threads, revert
the index, and report.

The patch is a work in progress – it generally works, but there're a
few nocommits, and, we may want to improve reporting (though I think
that's a separate issue).

Attachments

- Sort By Name
- Sort By Date
- Ascending
- Descending

LUCENE-2050.patch
13/Nov/09 13:29
42 kB
Michael McCandless
LUCENE-2050.patch
13/Nov/09 13:26
37 kB
Michael McCandless
LUCENE-2050.patch
10/Nov/09 19:56
31 kB
Michael McCandless

Activity

People

Assignee:: Michael McCandless

Reporter:: Michael McCandless

Votes:: 1 Vote for this issue

Watchers:: 0 Start watching this issue

Dates

Created:: 10/Nov/09 19:48

Updated:: 28/Aug/22 12:13

Resolved:: 14/Nov/09 11:14