[LUCENE-1489] highlighter problem with n-gram tokens - ASF JIRA

XML

Word

Printable

JSON

Details

Type: Bug
Status: Resolved
Priority: Minor
Resolution: Won't Fix
Affects Version/s: None
Fix Version/s: None
Component/s: modules/highlighter
Labels:
None

Lucene Fields:

New

Description

I have a problem when using n-gram and highlighter. I thought it had been solved in ~~LUCENE-627~~...

Actually, I found this problem when I was using CJKTokenizer on Solr, though, here is lucene program to reproduce it using NGramTokenizer(min=2,max=2) instead of CJKTokenizer:

public class TestNGramHighlighter {

  public static void main(String[] args) throws Exception {
    Analyzer analyzer = new NGramAnalyzer();
    final String TEXT = "Lucene can make index. Then Lucene can search.";
    final String QUERY = "can";
    QueryParser parser = new QueryParser("f",analyzer);
    Query query = parser.parse(QUERY);
    QueryScorer scorer = new QueryScorer(query,"f");
    Highlighter h = new Highlighter( scorer );
    System.out.println( h.getBestFragment(analyzer, "f", TEXT) );
  }

  static class NGramAnalyzer extends Analyzer {
    public TokenStream tokenStream(String field, Reader input) {
      return new NGramTokenizer(input,2,2);
    }
  }
}

expected output is:
Lucene can make index. Then Lucene can search.

but the actual output is:
Lucene can make index. Then Lucene can search.

Attachments

- Sort By Name
- Sort By Date
- Ascending
- Descending

lucene1489.patch
02/Oct/09 00:53
3 kB
David Bowen
LUCENE-1489.patch
10/Dec/09 22:20
3 kB
David Bowen

Issue Links

is duplicated by

LUCENE-6200 Highlighter sometime went wrong

Resolved

Activity

People

Assignee:: Unassigned

Reporter:: Koji Sekiguchi

Votes:: 3 Vote for this issue

Watchers:: 7 Start watching this issue

Dates

Created:: 12/Dec/08 01:35

Updated:: 28/Aug/22 11:56

Resolved:: 24/May/12 01:12

Agile

View on Board

highlighter problem with n-gram tokens

Details

Description

Attachments

Attachments

Issue Links

Activity

People

Dates

Agile

Slack

Issue deployment