Lucene - Core
  1. Lucene - Core
  2. LUCENE-1824

FastVectorHighlighter truncates words at beginning and end of fragments

    Details

    • Type: Improvement Improvement
    • Status: Closed
    • Priority: Minor Minor
    • Resolution: Fixed
    • Affects Version/s: None
    • Fix Version/s: 3.5, 4.0-ALPHA
    • Component/s: modules/highlighter
    • Labels:
      None
    • Environment:

      any

    • Lucene Fields:
      New, Patch Available

      Description

      FastVectorHighlighter does not take word boundaries into consideration when building fragments, so that in most cases the first and last word of a fragment are truncated. This makes the highlights less legible than they should be. I will attach a patch to BaseFragmentBuilder that resolves this by expanding the start and end boundaries of the fragment to the first whitespace character on either side of the fragment, or the beginning or end of the source text, whichever comes first. This significantly improves legibility, at the cost of returning a slightly larger number of characters than specified for the fragment size.

      1. LUCENE-1824.patch
        22 kB
        Koji Sekiguchi
      2. LUCENE-1824.patch
        21 kB
        Koji Sekiguchi
      3. LUCENE-1824.patch
        22 kB
        Koji Sekiguchi
      4. LUCENE-1824.patch
        17 kB
        Koji Sekiguchi
      5. LUCENE-1824.patch
        12 kB
        Koji Sekiguchi
      6. LUCENE-1824.patch
        9 kB
        Alex Vigdor

        Issue Links

          Activity

          Alex Vigdor created issue -
          Alex Vigdor made changes -
          Field Original Value New Value
          Attachment LUCENE-1824.patch [ 12416938 ]
          Alex Vigdor made changes -
          Attachment LUCENE-1824-test.patch [ 12416951 ]
          Attachment LUCENE-1824.patch [ 12416952 ]
          Alex Vigdor made changes -
          Attachment LUCENE-1824.patch [ 12416938 ]
          Michael Busch made changes -
          Fix Version/s 3.1 [ 12314025 ]
          Fix Version/s 2.9 [ 12312682 ]
          Alex Vigdor made changes -
          Attachment LUCENE-1824-test.patch [ 12417015 ]
          Alex Vigdor made changes -
          Attachment LUCENE-1824-test.patch [ 12416951 ]
          Alex Vigdor made changes -
          Attachment LUCENE-1824.patch [ 12417017 ]
          Alex Vigdor made changes -
          Attachment LUCENE-1824-test.patch [ 12417015 ]
          Alex Vigdor made changes -
          Attachment LUCENE-1824.patch [ 12416952 ]
          Robert Muir made changes -
          Component/s contrib/highlighter [ 12312096 ]
          Component/s contrib/* [ 12312028 ]
          Koji Sekiguchi made changes -
          Assignee Koji Sekiguchi [ koji ]
          Mark Thomas made changes -
          Workflow jira [ 12473536 ] Default workflow, editable Closed status [ 12563666 ]
          Mark Thomas made changes -
          Workflow Default workflow, editable Closed status [ 12563666 ] jira [ 12585237 ]
          Koji Sekiguchi made changes -
          Attachment LUCENE-1824.patch [ 12493106 ]
          Koji Sekiguchi made changes -
          Attachment LUCENE-1824.patch [ 12493260 ]
          Koji Sekiguchi made changes -
          Attachment LUCENE-1824.patch [ 12493304 ]
          Koji Sekiguchi made changes -
          Attachment LUCENE-1824.patch [ 12493518 ]
          Koji Sekiguchi made changes -
          Attachment LUCENE-1824.patch [ 12493535 ]
          Koji Sekiguchi made changes -
          Fix Version/s 3.5 [ 12317877 ]
          Koji Sekiguchi made changes -
          Link This issue blocks SOLR-2749 [ SOLR-2749 ]
          Koji Sekiguchi made changes -
          Status Open [ 1 ] Resolved [ 5 ]
          Resolution Fixed [ 1 ]
          Uwe Schindler made changes -
          Status Resolved [ 5 ] Closed [ 6 ]
          Gavin made changes -
          Link This issue blocks SOLR-2749 [ SOLR-2749 ]
          Gavin made changes -
          Link This issue is depended upon by SOLR-2749 [ SOLR-2749 ]

            People

            • Assignee:
              Koji Sekiguchi
              Reporter:
              Alex Vigdor
            • Votes:
              6 Vote for this issue
              Watchers:
              8 Start watching this issue

              Dates

              • Created:
                Updated:
                Resolved:

                Development