[LUCENE-675] Lucene benchmark: objective performance test for Lucene - ASF JIRA

XML

Word

Printable

JSON

Details

Type: Improvement
Status: Closed
Priority: Minor
Resolution: Fixed
Affects Version/s: None
Fix Version/s: None
Component/s: None
Labels:
None

Description

We need an objective way to measure the performance of Lucene, both indexing and querying, on a known corpus. This issue is intended to collect comments and patches implementing a suite of such benchmarking tests.

Regarding the corpus: one of the widely used and freely available corpora is the original Reuters collection, available from http://www-2.cs.cmu.edu/afs/cs.cmu.edu/project/theo-20/www/data/news20.tar.gz or http://people.csail.mit.edu/u/j/jrennie/public_html/20Newsgroups/20news-18828.tar.gz. I propose to use this corpus as a base for benchmarks. The benchmarking suite could automatically retrieve it from known locations, and cache it locally.

Attachments

- Sort By Name
- Sort By Date
- Ascending
- Descending

benchmark.byTask.patch
16/Nov/06 20:17
334 kB
Doron Cohen
benchmark.patch
06/Nov/06 03:25
71 kB
Grant Ingersoll
byTask.2.patch.txt
05/Jan/07 04:42
218 kB
Doron Cohen
byTask.jre1.4.patch.txt
11/Jan/07 07:48
219 kB
Doron Cohen
extract_reuters.plx
24/Oct/06 00:33
4 kB
Marvin Humphrey
LuceneBenchmark.java
21/Sep/06 05:18
31 kB
Andrzej Bialecki
taskBenchmark.zip
15/Nov/06 09:15
65 kB
Doron Cohen
timedata.zip
07/Nov/06 10:50
7 kB
Doron Cohen
tiny.alg
12/Nov/06 10:12
3 kB
Doron Cohen
tiny.properties
12/Nov/06 10:12
1.0 kB
Doron Cohen

Activity

People

Assignee:: Grant Ingersoll

Reporter:: Andrzej Bialecki

Votes:: 3 Vote for this issue

Watchers:: 0 Start watching this issue

Dates

Created:: 21/Sep/06 05:16

Updated:: 28/Aug/22 11:30

Resolved:: 13/Jan/07 04:16