Details
-
Improvement
-
Status: Closed
-
Major
-
Resolution: Fixed
-
None
-
None
Description
Need to find out what needs to be done / if this is possible / how to do it / what are our options.
The current default OakAnalyzer uses the following configuration:
TokenStreamComponents( WordDelimiterFilter( LowerCaseFilter( StandardTokenizer( Reader ) ), GENERATE_WORD_PARTS | STEM_ENGLISH_POSSESSIVE | (indexOriginalTerm ? PRESERVE_ORIGINAL : 0) | GENERATE_NUMBER_PARTS ) )