Uploaded image for project: 'Lucene - Core'
  1. Lucene - Core
  2. LUCENE-2603

FastVectorHighlighter: add a method to set an arbitrary char that is used when concatenating multiValued data

Attach filesAttach ScreenshotVotersWatch issueWatchersCreate sub-taskLinkCloneUpdate Comment AuthorReplace String in CommentUpdate Comment VisibilityDelete Comments
    XMLWordPrintableJSON

    Details

    • Type: Improvement
    • Status: Closed
    • Priority: Trivial
    • Resolution: Fixed
    • Affects Version/s: 2.9.3, 3.0.2
    • Fix Version/s: 3.1, 4.0-ALPHA
    • Component/s: modules/highlighter
    • Labels:
      None
    • Lucene Fields:
      New

      Description

      If the following multiValued names are in authors field:

      • Michael McCandless
      • Erik Hatcher
      • Otis Gospodnetić

      Since FragmentsBuilder concatenates multiValued data with a space in BaseFragmentsBuilder.getFragmentSource():

      while( buffer.length() < endOffset && index[0] < values.length ){
        if( index[0] > 0 && values[index[0]].isTokenized() && values[index[0]].stringValue().length() > 0 )
          buffer.append( ' ' );
        buffer.append( values[index[0]++].stringValue() );
      }
      

      an entire field snippet (using LUCENE-2464) will be "Michael McCandless Erik Hatcher Otis Gospodnetić". There is a requirement an arbitrary char (e.g. '/') can be set so that client can separate the snippet easily. i.e. "Michael McCandless/Erik Hatcher/Otis Gospodnetić"

        Attachments

          Activity

            People

            • Assignee:
              koji Koji Sekiguchi
              Reporter:
              koji Koji Sekiguchi

              Dates

              • Created:
                Updated:
                Resolved:

                Issue deployment