Details
-
Bug
-
Status: Closed
-
Major
-
Resolution: Fixed
-
tools-1.5.3
-
None
Description
Hyphenated tokens are not handled, for example "aprova-se" should be separated in two tokens. ADTokenSampleStream relies on a DictionaryDetokenizer, that don't know that "aprova-" "se" should be merged.