Uploaded image for project: 'Lucene - Core'
  1. Lucene - Core
  2. LUCENE-1889

FastVectorHighlighter: support for additional queries



    • Wish
    • Status: Closed
    • Minor
    • Resolution: Fixed
    • None
    • 3.5, 4.0-ALPHA
    • modules/highlighter
    • None
    • New


      I am using fastvectorhighlighter for some strange languages and it is working well!

      One thing i noticed immediately is that many query types are not highlighted (multitermquery, multiphrasequery, etc)
      Here is one thing Michael M posted in the original ticket:

      I think a nice [eventual] model would be if we could simply re-run the
      scorer on the single document (using InstantiatedIndex maybe, or
      simply some sort of wrapper on the term vectors which are already a
      mini-inverted-index for a single doc), but extend the scorer API to
      tell us the exact term occurrences that participated in a match (which
      I don't think is exposed today).

      Due to strange requirements I am using something similar to this (but specialized to our case).
      I am doing strange things like forcing multitermqueries to rewrite into boolean queries so they will be highlighted,
      and flattening multiphrasequeries into boolean or'ed phrasequeries.
      I do not think these things would be 'fast', but i had a few ideas that might help:

      • looking at contrib/highlighter, you can support FilteredQuery in flatten() by calling getQuery() right?
      • maybe as a last resort, try Query.extractTerms() ?


        1. LUCENE-1889_reader.patch
          2 kB
          Robert Muir
        2. LUCENE-1889.patch
          15 kB
          Michael Sokolov
        3. LUCENE-1889.patch
          14 kB
          Michael Sokolov
        4. LUCENE-1889.patch
          18 kB
          Michael Sokolov
        5. LUCENE-1889-solr.patch
          0.8 kB
          Michael Sokolov

        Issue Links



              koji Koji Sekiguchi
              rcmuir Robert Muir
              4 Vote for this issue
              7 Start watching this issue