Uploaded image for project: 'UIMA'
  1. UIMA
  2. UIMA-4453

MARKTABLE action works improperly

    XMLWordPrintableJSON

Details

    • Bug
    • Status: Resolved
    • Major
    • Resolution: Fixed
    • 2.3.0ruta
    • 2.3.1ruta
    • Ruta
    • None
    • OS X 10.9.1, Java v8u45, Eclipse Luna
      Windows 7, Java v8u45, Eclipse Luna

    Description

      New available UIMA Ruta Runtime 2.7.0 & Workbench 2.3.0 for Eclipse has lost proper functionality of MARKTABLE action. This action stopped annotating of all words from a csv file. I had noticed that the problem happened only for words written in Cyrillic witch contains spaces, i.e. for Latin it works fine. Please use sample outlined below in order to reproduce the problem i'm talking about.

      1. script/main.ruta
        WORDTABLE Dict = 'dict.csv';
        DECLARE Annotation Test (STRING meaning);
        Document {-> MARKTABLE(Test,1,Dict, "meaning" = 2)};
      1. resources/dict.csv
        від;from
        с какой стати;why
        с которой;fromWhich
        сюда;here
        по какому;which
        сюди;here
        как нибудь;somehow
        сколько;howMuch
      1. input/test.txt
        від с какой стати с которой сюда по какому сюди как нибудь сколько

      After main.ruta script execution we wont get annotated everything from test.txt Worth mentioning that Cyrillic letter like 'с' at the beginning of string, somehow affecting on processing behavior. Moreover, by removing lines with spaces, will get rid us from the issue described above.

      Attachments

        1. ruta-2.3.1-SNAPSHOT.tar.gz
          9.42 MB
          Oleg Fedoriaka

        Activity

          People

            pkluegl Peter Klügl
            submedia Oleg Fedoriaka
            Votes:
            0 Vote for this issue
            Watchers:
            2 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved:

              Time Tracking

                Estimated:
                Original Estimate - 96h
                96h
                Remaining:
                Remaining Estimate - 96h
                96h
                Logged:
                Time Spent - Not Specified
                Not Specified