Uploaded image for project: 'Lucene - Core'
  1. Lucene - Core
  2. LUCENE-6485

Add a custom separator break iterator

    XMLWordPrintableJSON

Details

    • Improvement
    • Status: Closed
    • Major
    • Resolution: Fixed
    • None
    • 5.3, 6.0
    • None
    • None
    • New

    Description

      Lucene currently includes a WholeBreakIterator used to highlight entire fields using the postings highlighter, without breaking their content into sentences.

      I would like to contribute a CustomSeparatorBreakIterator that breaks when a custom char separator is found in the text. This can be used for instance when wanting to highlight entire fields, value per value. One can subclass PostingsHighlighter and have getMultiValueSeparator return a control character, like U+0000 , then use the custom break iterator to break on U+0000 so that one snippet per value will be generated.

      Attachments

        1. LUCENE-6485.patch
          13 kB
          Luca Cavanna

        Activity

          People

            Unassigned Unassigned
            lucacavanna Luca Cavanna
            Votes:
            0 Vote for this issue
            Watchers:
            4 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved: