Uploaded image for project: 'Lucene - Core'
  1. Lucene - Core
  2. LUCENE-3748

EnglishPossessiveFilter should work with Unicode right single quotation mark

    Details

    • Type: Improvement
    • Status: Closed
    • Priority: Minor
    • Resolution: Fixed
    • Affects Version/s: 3.1, 3.2, 3.4, 3.5
    • Fix Version/s: 3.6, 4.0-ALPHA
    • Component/s: modules/analysis
    • Labels:
      None
    • Lucene Fields:
      New, Patch Available

      Description

      The current EnglishPossessiveFilter (used in EnglishAnalyzer) removes possessives using only the '\'' character (plus 's' or 'S'), but some common systems (German?) insert the Unicode "\u2019" (RIGHT SINGLE QUOTATION MARK) instead and this is not removed when processing UTF-8 text. I propose to change EnglishPossesiveFilter to support '\u2019' as an alternative to '\''.

        Attachments

        1. LUCENE-3748.patch
          5 kB
          Robert Muir
        2. Patch-Lucene-3748
          3 kB
          David Croley
        3. LucenePatch
          2 kB
          David Croley

          Activity

            People

            • Assignee:
              rcmuir Robert Muir
              Reporter:
              dcroley David Croley
            • Votes:
              0 Vote for this issue
              Watchers:
              0 Start watching this issue

              Dates

              • Created:
                Updated:
                Resolved: