Details
-
Improvement
-
Status: Closed
-
Minor
-
Resolution: Delivered
-
None
-
None
-
None
Description
It turned out that Doccats bag of word feature generator can be very sensitive to numbers when used for language identification. Therefore numbers should not be included in the bag of words.