[LUCENE-1166] A tokenfilter to decompose compound words - ASF JIRA

XML

Word

Printable

JSON

Details

Type: New Feature
Status: Resolved
Priority: Minor
Resolution: Fixed
Affects Version/s: None
Fix Version/s: None
Component/s: modules/analysis
Labels:
None

Lucene Fields:

Patch Available

Description

A tokenfilter to decompose compound words you find in many germanic languages (like German, Swedish, ...) into single tokens.

An example: Donaudampfschiff would be decomposed to Donau, dampf, schiff so that you can find the word even when you only enter "Schiff".

I use the hyphenation code from the Apache XML project FOP (http://xmlgraphics.apache.org/fop/) to do the first step of decomposition. Currently I use the FOP jars directly. I only use a handful of classes from the FOP project.

My question now:
Would it be OK to copy this classes over to the Lucene project (renaming the packages of course) or should I stick with the dependency to the FOP jars? The FOP code uses the ASF V2 license as well.

What do you think?

Attachments

- Sort By Name
- Sort By Date
- Ascending
- Descending

hyphenation.dtd
06/Feb/08 11:11
3 kB
Thomas Peuss
de.xml
06/Feb/08 11:10
48 kB
Thomas Peuss
CompoundTokenFilter.patch
06/Feb/08 11:08
71 kB
Thomas Peuss
CompoundTokenFilter.patch
12/Feb/08 11:09
76 kB
Thomas Peuss
CompoundTokenFilter.patch
14/Feb/08 16:22
85 kB
Thomas Peuss
CompoundTokenFilter.patch
03/Mar/08 16:35
90 kB
Thomas Peuss
CompoundTokenFilter.patch
25/Mar/08 12:49
91 kB
Thomas Peuss
CompoundTokenFilter.patch
25/Mar/08 12:56
90 kB
Thomas Peuss
CompoundTokenFilter.patch
29/Mar/08 11:04
90 kB
Thomas Peuss
CompoundTokenFilter.patch
24/Apr/08 10:11
99 kB
Thomas Peuss
CompoundTokenFilter.patch
30/Apr/08 09:17
105 kB
Thomas Peuss
CompoundTokenFilter.patch
30/Apr/08 14:26
106 kB
Thomas Peuss
CompoundTokenFilter.patch
16/May/08 11:32
106 kB
Thomas Peuss

Issue Links

relates to

LUCENE-1287 Allow usage of HyphenationCompoundWordTokenFilter without dictionary

Closed

Activity

People

Assignee:: Grant Ingersoll

Reporter:: Thomas Peuss

Votes:: 0 Vote for this issue

Watchers:: 1 Start watching this issue

Dates

Created:: 06/Feb/08 11:08

Updated:: 28/Aug/22 11:46

Resolved:: 16/May/08 12:28