Uploaded image for project: 'UIMA'
  1. UIMA
  2. UIMA-2947

Improve format of multi-word entries in dictionary files

    XMLWordPrintableJSON

Details

    Description

      Using a single character to separate tokens in a Dictionary Annotator's dictionary file is not XML like. It looks like a remnant from old comma-separated-value days. So remove multiWordSeparator from dictionaryMetaData and let an entry look like <entry><key><token>AOL</token><token>Mail</token></key></entry> or <entry><key><token>azbuz</token><token>.</token><token>com</token></key></entry>. By the way, what is <key> good for? Do we need it?

      Attachments

        Activity

          People

            Unassigned Unassigned
            ithohd9u Armin Wegner
            Votes:
            0 Vote for this issue
            Watchers:
            3 Start watching this issue

            Dates

              Created:
              Updated: