Details
-
Bug
-
Status: Closed
-
Major
-
Resolution: Cannot Reproduce
-
tools-1.5.3
-
None
-
None
-
Patch, Important
Description
According to the documentation, the following should work
bin/opennlp TokenNameFinderConverter conll02 -data esp.train -lang es -types per > es_corpus_train_persons.txt
However currently it delivers error message since it expects 3 columns instead of 2 that are in the dataset.
This is a bug, introduced at line 130 of opennlp.tools.formats.Conll02NameSampleStream.java where a length of 3 is imposed.
Attachments
Issue Links
- relates to
-
OPENNLP-1191 achieve compatibility with stanford 2 column input
- Closed