|
HI Mike, I'm planning to branch the trunk today. Considering the file format changes, it might be a bit risky to apply this patch last minute. I think we should commit this for 2.4. What do you think? -Michael
OK, I agree, it is somewhat risky, so let's wait. (Though it is a sizable gain in performance!).
I wonder how accurate these profiling numbers are? Java profiling slows -Michael
Good point – I haven't measured outside of profiling. I plan to build a full Wiki index with and without this change to test.... Attached patch updated to current trunk. All tests pass. I plan to
commit after 2.3 is out... OK I ran a performance test with this patch, indexing the first 200K analyzer=org.apache.lucene.analysis.standard.StandardAnalyzer doc.stored = true docs.file=/Volumes/External/lucene/wiki.txt directory=FSDirectory merge.scheduler=org.apache.lucene.index.SerialMergeScheduler { "Rounds" NewRound RepSumByPrefRound BuildIndex I used SerialMergeScheduler so that I could measure time saved due to Without the patch, best of 3 was 509.0 sec; with patch, best of 3 was Attaching the right patch this time...
I just committed this. Note that this is a [small] change to the index format, so if you use trunk to build an index, 2.3 won't be able to read it!
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
(Note that the TestBackwardsCompatibility test will fail if you apply the patch because the new *.zip files I added aren't in the patch).
I think we should commit this for 2.3? It's a sizable gain in merging
performance.