Uploaded image for project: 'Stanbol (Retired)'
  1. Stanbol (Retired)
  2. STANBOL-855

Add basic language support for Chinese

    XMLWordPrintableJSON

Details

    • New Feature
    • Status: Closed
    • Major
    • Resolution: Fixed
    • None
    • commons-0.11.0
    • Commons
    • None

    Description

      Basic language support includes:

      • An Chinese tokenizer that creates Tokens for the AnalyzedText content part
      • LabelTokenizer for the EntityLinking engine that supports Chinese text
      • add support for Chinese specific Solr fieldType to the Stanbol Entityhub so that Chinese text is correctly indexed

      This will bring basic EntityLinking support for Chinese texts to Apache Stanbol. This means that parsed Text is correctly tokenized and tokens can be matched with controlled vocabularies.

      Attachments

        Issue Links

          Activity

            People

              rwesten Rupert Westenthaler
              rwesten Rupert Westenthaler
              Votes:
              0 Vote for this issue
              Watchers:
              1 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: