Uploaded image for project: 'OpenNLP'
  1. OpenNLP
  2. OPENNLP-854

Character changes during Sentence Detector Training

    XMLWordPrintableJSON

Details

    • Bug
    • Status: Closed
    • Major
    • Resolution: Won't Fix
    • tools-1.5.3
    • None
    • Sentence Detector
    • Windows 7 using opennlp-tools-1.5.3.jar directly from Command Prompt and Powershell

    Description

      When I try to learn a model for detecting Sentences for Persian language, the Unicode character U+0641 changes to something else and every occurrences of this char in the input text is changing to an unknown character in output text.

      Attachments

        Activity

          People

            Unassigned Unassigned
            lingwanderer dave hey
            Votes:
            0 Vote for this issue
            Watchers:
            2 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved: