Uploaded image for project: 'Apache Jena'
  1. Apache Jena
  2. JENA-1058

add ASCIIFoldingLowerCaseKeywordAnalyzer to jena-text

    XMLWordPrintableJSON

Details

    • New Feature
    • Status: Closed
    • Major
    • Resolution: Won't Fix
    • None
    • None
    • Text
    • None

    Description

      I'd like to have an Analyzer for jena-text which is otherwise like LowerCaseKeywordAnalyzer that I've implemented before, but also includes the ASCIIFoldingFilter from Lucene. This means that the comparison will ignore accents, so that for example "deja vu" will match "déjà vu".

      For some background on why I need this, see https://github.com/NatLibFi/Skosmos/issues/313

      I already have an implementation of this ready, will make a PR shortly.

      Attachments

        Activity

          People

            osma Osma Suominen
            osma Osma Suominen
            Votes:
            0 Vote for this issue
            Watchers:
            3 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved: