[LUCENE-3848] basetokenstreamtestcase should fail if tokenstream starts with posinc=0 - ASF JIRA

XML

Word

Printable

JSON

Details

Type: Bug
Status: Closed
Priority: Major
Resolution: Fixed
Affects Version/s: None
Fix Version/s: 3.6, 4.0-ALPHA
Component/s: None
Labels:
None

Lucene Fields:

New

Description

This is meaningless for a tokenstream to start with posinc=0,

Its also caused problems and hairiness in the indexer (~~LUCENE-1255~~, ~~LUCENE-1542~~),
and it makes senseless tokenstreams. We should add a check and fix any that do this.

Furthermore the same bug can exist in removing-filters if they have enablePositionIncrements=false.
I think this option is useful: but it shouldnt mean 'allow broken tokenstream', it just means we
don't add gaps.

If you remove tokens with enablePositionIncrements=false it should not cause the TS to start with
positionincrement=0, and it shouldnt 'restructure' the tokenstream (e.g. moving synonyms on top of a different word).
It should just not add any 'holes'.

Attachments

- Sort By Name
- Sort By Date
- Ascending
- Descending

LUCENE-3848-MockGraphTokenFilter.patch
04/Mar/12 21:28
21 kB
Michael McCandless
LUCENE-3848.patch
04/Mar/12 19:41
2 kB
Robert Muir
LUCENE-3848.patch
15/Mar/12 17:12
9 kB
Robert Muir

Activity

People

Assignee:: Unassigned

Reporter:: Robert Muir

Votes:: 0 Vote for this issue

Watchers:: 0 Start watching this issue

Dates

Created:: 04/Mar/12 19:39

Updated:: 28/Aug/22 13:10

Resolved:: 16/Mar/12 14:29