[LUCENE-5278] MockTokenizer throws away the character right after a token even if it is a valid start to a new token - ASF JIRA

XML

Word

Printable

JSON

Details

Type: Bug
Status: Resolved
Priority: Trivial
Resolution: Fixed
Affects Version/s: None
Fix Version/s: 4.6, 6.0
Component/s: None
Labels:
None

Lucene Fields:

New

Description

MockTokenizer throws away the character right after a token even if it is a valid start to a new token. You won't see this unless you build a tokenizer that can recognize every character like with new RegExp(".") or RegExp("...").

Changing this behaviour seems to break a number of tests.

Attachments

- Sort By Name
- Sort By Date
- Ascending
- Descending

LUCENE-5278.patch
11/Oct/13 19:31
5 kB
Nik Everett
LUCENE-5278.patch
12/Oct/13 00:48
6 kB
Robert Muir
LUCENE-5278.patch
12/Oct/13 01:35
9 kB
Robert Muir

Activity

People

Assignee:: Robert Muir

Reporter:: Nik Everett

Votes:: 0 Vote for this issue

Watchers:: 2 Start watching this issue

Dates

Created:: 11/Oct/13 19:25

Updated:: 28/Aug/22 13:55

Resolved:: 12/Oct/13 04:31