Details
-
Task
-
Status: Closed
-
Major
-
Resolution: Fixed
-
None
-
None
-
None
-
None
Description
Over on TIKA-2790, we found that opennlp's language detector is far, far slower than Optimaize and yalder.
Let's use this ticket to see what we can do to improve lang detect's speed.
Attachments
Issue Links
- relates to
-
OPENNLP-1267 Allow the LanguageDetector to stop before processing the full string
- Closed
-
OPENNLP-1266 Limit normalization regexes in UrlCharSequenceNormalizer
- Closed
-
OPENNLP-1269 Add alternate to NGramModel that uses straight Strings rather than StringList
- Closed