Uploaded image for project: 'Lucene - Core'
  1. Lucene - Core
  2. LUCENE-2015

ASCIIFoldingFilter: expose folding logic + small improvements to ISOLatin1AccentFilter

Details

    • Improvement
    • Status: Closed
    • Minor
    • Resolution: Fixed
    • None
    • 4.0-ALPHA
    • modules/analysis
    • None
    • New, Patch Available

    Description

      This patch adds a couple of non-ascii chars to ISOLatin1AccentFilter (namely: left & right single quotation marks, en dash, em dash) which we very frequently encounter in our projects. I know that this class is now deprecated; this improvement is for legacy code that hasn't migrated yet.

      It also enables easy access to the ascii folding technique use in ASCIIFoldingFilter for potential re-use in non-Lucene-related code.

      Attachments

        1. ASCIIFoldingFilter-no_formatting.patch
          4 kB
          Cédrik LIME
        2. ASCIIFoldingFilter-no_formatting.patch
          2 kB
          Cédrik LIME
        3. Filters.patch
          220 kB
          Cédrik LIME
        4. ISOLatin1AccentFilter.patch
          6 kB
          Cédrik LIME
        5. LUCENE-2015.patch
          1 kB
          Cédrik LIME
        6. LUCENE-2015.patch
          1 kB
          Robert Muir

        Activity

          People

            rcmuir Robert Muir
            cedrik_lime Cédrik LIME
            Votes:
            0 Vote for this issue
            Watchers:
            1 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved:

              Slack

                Issue deployment