[LUCENE-133] [PATCH] QueryParser assumes getPositionIncrement() == 1 - ASF JIRA

XML

Word

Printable

JSON

Details

Type: Bug
Status: Closed
Priority: Major
Resolution: Fixed
Affects Version/s: None
Fix Version/s: None
Component/s: core/queryparser
Labels:
None
Environment:

Operating System: All
Platform: PC

Bugzilla Id:
23307

Description

I've written an analyzer that can output several tokens when just one is input.
Say : "language" is analyzed as "C", "C++", "Java".

As stated by the docs, the first token (i.e. "C") is given a PositionIncrement
of 1 while the other ones have a PositionIncrement of 0. All share the same
positions as well.

When parsed by the QueryParser, the query :
language

...is interpreted as the PhraseQuery :
C C++ Java

...which is obviously not what I want.

I think the condition that triggers a PhraseQuery (vector's size > 1) is
over-simplistic. My tokens should feed a BooleanQuery with 3 clauses :
C || C++ || Java

However, if I input a 2 tokens query, I surely want (at least) a PhraseQuery.

Say now that "OS" is analyzed as "Windows", "Unix", "MacOS" (with
PositionIncrements set to 1-0-0 and same positions).

The query "language OS" should be parsed as :
"C Windows" || "C++ Windows" || "Java Windows" || C Unix" || "C++ Unix"

"Java Unix"	"C MacOS"	"C++ MacOS"	"Java MacOS".

Well... there may be a better optimization for that but in any case, I think
that QueryParser.getFieldQuery(String field, Analyzer analyzer, String
queryText) can not afford to lose the Tokens.getPositionIncrement as it
acutally does.

p.b.

Attachments

- Sort By Name
- Sort By Date
- Ascending
- Descending

ASF.LICENSE.NOT.GRANTED--ZeroPositionIncrementTokenQueryParser.jj
27/Apr/04 23:35
27 kB
Pierrick Brihaye
ASF.LICENSE.NOT.GRANTED--queryparser.diff
11/Nov/04 22:33
3 kB
Daniel Naber
ASF.LICENSE.NOT.GRANTED--TestMultiAnalyzer.java
11/Nov/04 22:34
4 kB
Daniel Naber

Activity

People

Assignee:: Lucene Developers

Reporter:: Pierrick Brihaye

Votes:: 0 Vote for this issue

Watchers:: 0 Start watching this issue

Dates

Created:: 22/Sep/03 01:21

Updated:: 28/Aug/22 11:13

Resolved:: 27/May/06 01:36