Details
-
New Feature
-
Status: Closed
-
Major
-
Resolution: Fixed
-
None
-
None
Description
Basic language support includes:
- An Chinese tokenizer that creates Tokens for the AnalyzedText content part
- LabelTokenizer for the EntityLinking engine that supports Chinese text
- add support for Chinese specific Solr fieldType to the Stanbol Entityhub so that Chinese text is correctly indexed
This will bring basic EntityLinking support for Chinese texts to Apache Stanbol. This means that parsed Text is correctly tokenized and tokens can be matched with controlled vocabularies.
Attachments
Issue Links
- relates to
-
STANBOL-875 Add support for Paoding (Chinese)
- Closed