Details
-
Improvement
-
Status: Closed
-
Major
-
Resolution: Fixed
-
tools-1.5.3
-
None
Description
We need to improve how contractions are handled: some are expanded to more than 2 tokens. Also should force tokenization of named entities that has punctuations.