Uploaded image for project: 'Lucene - Core'
  1. Lucene - Core
  2. LUCENE-3942

SynonymFilter should set pos length att

Attach filesAttach ScreenshotVotersWatch issueWatchersCreate sub-taskLinkCloneUpdate Comment AuthorReplace String in CommentUpdate Comment VisibilityDelete Comments
    XMLWordPrintableJSON

    Details

    • Type: Improvement
    • Status: Closed
    • Priority: Major
    • Resolution: Fixed
    • Affects Version/s: None
    • Fix Version/s: 4.0-ALPHA, 3.6.1
    • Component/s: None
    • Labels:
      None
    • Lucene Fields:
      New

      Description

      Tokenizers/Filters can now produce graphs instead of a single linear
      chain of tokens, by setting the PositionLengthAttribute, expressing
      where (how many positions ahead) this token "ends".

      The default is 1, meaning it ends at the next position, to be
      backwards compatible.

      SynonymFilter produces graph output tokens, as long as the output is a
      single token, but currently never sets the pos length to express this.
      EG for the rule "wifi network -> hotspot", the hotspot token should
      have pos length = 2. With LUCENE-3940 this will allow us to verify
      that the offsets for such tokens are correct...

        Attachments

          Activity

            People

            • Assignee:
              mikemccand Michael McCandless
              Reporter:
              mikemccand Michael McCandless

              Dates

              • Created:
                Updated:
                Resolved:

                Issue deployment