Uploaded image for project: 'Lucene - Core'
  1. Lucene - Core
  2. LUCENE-8730

Ensure WordDelimiterGraphFilter always emits its original token first

Attach filesAttach ScreenshotVotersWatch issueWatchersCreate sub-taskLinkCloneUpdate Comment AuthorReplace String in CommentUpdate Comment VisibilityDelete Comments
    XMLWordPrintableJSON

Details

    • Improvement
    • Status: Resolved
    • Major
    • Resolution: Fixed
    • None
    • 8.1
    • None
    • None
    • New

    Description

      WordDelimiterFilter and WordDelimiterGraphFilter behave almost identically outside setting position length; the only difference being that WDGF can sometimes emit its original token as the second output token rather than the first. We should change this to conform to the behaviour of the older filter - this will make it much easier to remove WDF entirely and cut over tests that use it incidentally.

      Attachments

        Issue Links

        Activity

          This comment will be Viewable by All Users Viewable by All Users
          Cancel

          People

            romseygeek Alan Woodward
            romseygeek Alan Woodward
            Votes:
            0 Vote for this issue
            Watchers:
            3 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved:

              Slack

                Issue deployment