Uploaded image for project: 'Lucene - Core'
  1. Lucene - Core
  2. LUCENE-5503

Trivial fixes to WeightedSpanTermExtractor

    Details

    • Type: Bug
    • Status: Closed
    • Priority: Minor
    • Resolution: Fixed
    • Affects Version/s: 4.7
    • Fix Version/s: 5.4
    • Component/s: modules/highlighter
    • Labels:
      None
    • Lucene Fields:
      New, Patch Available

      Description

      The conversion of PhraseQuery to SpanNearQuery miscalculates the slop if there are stop words in some cases. The issue only really appears if there is more than one intervening run of stop words: ab the cd the the ef.

      I also noticed that the inOrder determination is based on the newly calculated slop, and it should probably be based on the original phraseQuery.getSlop()

      patch and unit tests on way

        Attachments

        1. LUCENE-5503.patch
          6 kB
          Tim Allison
        2. LUCENE-5503v2.patch
          7 kB
          Tim Allison
        3. LUCENE-5503.patch
          8 kB
          David Smiley

          Activity

            People

            • Assignee:
              dsmiley David Smiley
              Reporter:
              tallison@apache.org Tim Allison
            • Votes:
              0 Vote for this issue
              Watchers:
              5 Start watching this issue

              Dates

              • Created:
                Updated:
                Resolved: