Thanks for the patch Michael,
The patch worked fine with CheckIndex. Checkindex worked with an index with 2.49 billion terms.
I added commas to the output below:
test: terms, freq, prox...OK [2,487,224,745 terms; 23,573,976,855 terms/docs pairs; 97,223,318,067 tokens]
We are working on determining how to test it with Solr. The ArrayIndexOutOfBounds exception appears in the logs about for about 1 in every 100 queries. We haven't been able to determine which queries trigger the problem.
We are using an older version of Solr with lucene 2.9-dev 779312 - 2009-05-27 17:19:55 . I'm not sure if we can just drop in a later version of lucene with the patch or if I need to patch the older 2.9 dev lucene version that came with our Solr. What do you suggest?
What I'm thinking of is to run 10,000 queries against our dev server pointing at one of the large segment indexes with and without the patch.