[LUCENE-5843] IndexWriter should refuse to create an index with more than INT_MAX docs - ASF JIRA

XML

Word

Printable

JSON

Details

Type: Bug
Status: Closed
Priority: Major
Resolution: Fixed
Affects Version/s: None
Fix Version/s: 4.9.1, 4.10, 6.0
Component/s: core/index
Labels:
None

Lucene Fields:

New

Description

It's more and more common for users these days to create very large indices, e.g. indexing lines from log files, or packets on a network, etc., and it's not hard to accidentally exceed the maximum number of documents in one index.

I think the limit is actually Integer.MAX_VALUE-1 docs, because we use that value as a sentinel during searching.

I'm not sure what IW does today if you create a too-big index but it's probably horrible; it may succeed and then at search time you hit nasty exceptions when we overflow int.

I think it should throw an IndexFullException instead. It'd be nice if we could do this on the very doc that when added would go over the limit, but I would also settle for just throwing at flush as well ... i.e. I think what's really important is that the index does not become unusable.

Attachments

- Sort By Name
- Sort By Date
- Ascending
- Descending

LUCENE-5843.patch
29/Jul/14 14:31
30 kB
Michael McCandless
LUCENE-5843.patch
27/Jul/14 23:00
31 kB
Michael McCandless
LUCENE-5843.patch
25/Jul/14 22:36
29 kB
Michael McCandless
LUCENE-5843.patch
24/Jul/14 23:09
23 kB
Michael McCandless

Issue Links

is depended upon by

SOLR-6065 Solr should give you clear error if you try to add too many docs

Open

Activity

People

Assignee:: Michael McCandless

Reporter:: Michael McCandless

Votes:: 0 Vote for this issue

Watchers:: 4 Start watching this issue

Dates

Created:: 23/Jul/14 16:55

Updated:: 28/Aug/22 14:12

Resolved:: 16/Sep/14 22:13