Uploaded image for project: 'OpenNLP'
  1. OpenNLP
  2. OPENNLP-1013

[OpenNLP][R Language][1.5.3-2] Bug when using French models

    XMLWordPrintableJSON

    Details

    • Type: Bug
    • Status: Closed
    • Priority: Major
    • Resolution: Won't Fix
    • Affects Version/s: tools-1.5.3
    • Fix Version/s: None
    • Component/s: POS Tagger
    • Labels:
      None
    • Environment:
      R Language, RStudio

      Description

      When using French models in R language, I'm receving a "subscript out of bound" issue. I'm going to detail:

      -------------------------

      Well, I'm using French models to NLP in R environment. To get the french models, I'm using binaries compiled and develloped by Nicolas:

      https://sites.google.com/site/nicolashernandez/resources/opennlp
      http://enicolashernandez.blogspot.fr/2012/12/apache-opennlp-fr-models.html
      https://drive.google.com/drive/folders/0B4AyWQriFkxgWHR6QzlvcmxmdE0

      -------------------------

      The problem it happens only with the POS function. This is how I call the function and respective issue:

      Maxent_POS_Tag_Annotator(language = "fr", probs = TRUE, model = paste0(<path_folder_with_bins>, "fr-pos.bin"))

      Issue:
      Error in environment(f)$meta[[tag]] : subscript out of bounds

      -------------------------

      However, if I deleted the language parameter, the issue does not happen anymore:

      Maxent_POS_Tag_Annotator(probs = TRUE, model = paste0(<path_folder_with_bins>, "fr-pos.bin"))

        Attachments

          Activity

            People

            • Assignee:
              Unassigned
              Reporter:
              iurinog Iuri Deolindo Nogueira
            • Votes:
              0 Vote for this issue
              Watchers:
              3 Start watching this issue

              Dates

              • Created:
                Updated:
                Resolved: