[JENA-1058] add ASCIIFoldingLowerCaseKeywordAnalyzer to jena-text - ASF JIRA

XML

Word

Printable

JSON

Details

Type: New Feature
Status: Closed
Priority: Major
Resolution: Won't Fix
Affects Version/s: None
Fix Version/s: None
Component/s: Text
Labels:
None

Description

I'd like to have an Analyzer for jena-text which is otherwise like LowerCaseKeywordAnalyzer that I've implemented before, but also includes the ASCIIFoldingFilter from Lucene. This means that the comparison will ignore accents, so that for example "deja vu" will match "déjà vu".

For some background on why I need this, see https://github.com/NatLibFi/Skosmos/issues/313

I already have an implementation of this ready, will make a PR shortly.

Attachments

Activity

People

Assignee:: Osma Suominen

Reporter:: Osma Suominen

Votes:: 0 Vote for this issue

Watchers:: 3 Start watching this issue

Dates

Created:: 29/Oct/15 15:14

Updated:: 05/Nov/15 07:34

Resolved:: 05/Nov/15 07:34