|
Michael McCandless made changes - 07/Jan/08 02:55 PM
HI Mike, I'm planning to branch the trunk today. Considering the file format changes, it might be a bit risky to apply this patch last minute. I think we should commit this for 2.4. What do you think? -Michael
OK, I agree, it is somewhat risky, so let's wait. (Though it is a sizable gain in performance!).
I wonder how accurate these profiling numbers are? Java profiling slows -Michael
Good point – I haven't measured outside of profiling. I plan to build a full Wiki index with and without this change to test.... Attached patch updated to current trunk. All tests pass. I plan to
commit after 2.3 is out... OK I ran a performance test with this patch, indexing the first 200K analyzer=org.apache.lucene.analysis.standard.StandardAnalyzer doc.stored = true docs.file=/Volumes/External/lucene/wiki.txt directory=FSDirectory merge.scheduler=org.apache.lucene.index.SerialMergeScheduler { "Rounds" NewRound RepSumByPrefRound BuildIndex I used SerialMergeScheduler so that I could measure time saved due to Without the patch, best of 3 was 509.0 sec; with patch, best of 3 was
Michael McCandless made changes - 21/Jan/08 02:51 AM
Michael McCandless made changes - 21/Jan/08 02:52 AM
Attaching the right patch this time...
Michael McCandless made changes - 21/Jan/08 02:53 AM
I just committed this. Note that this is a [small] change to the index format, so if you use trunk to build an index, 2.3 won't be able to read it!
Michael McCandless made changes - 25/Jan/08 11:33 AM
Michael McCandless made changes - 11/Oct/08 12:49 PM
|
|||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
(Note that the TestBackwardsCompatibility test will fail if you apply the patch because the new *.zip files I added aren't in the patch).
I think we should commit this for 2.3? It's a sizable gain in merging
performance.