Details
-
Improvement
-
Status: Closed
-
Trivial
-
Resolution: Won't Fix
-
2.1
-
None
-
None
-
None
-
New, Patch Available
Description
This patch adds punctuation (comma, period, question mark and exclamation point) tokens as output from the StandardTokenizer, and filters them out in the StandardFilter.
(I needed them for text classification reasons.)