Uploaded image for project: 'OpenNLP'
  1. OpenNLP
  2. OPENNLP-764

UIMA NameFinder Annotator should call clearAdaptiveData

    XMLWordPrintableJSON

    Details

    • Type: Bug
    • Status: Closed
    • Priority: Major
    • Resolution: Invalid
    • Affects Version/s: 1.7.0
    • Fix Version/s: None
    • Component/s: UIMA Integration
    • Labels:
      None

      Description

      Following the documentation (http://opennlp.apache.org/documentation/1.5.3/manual/opennlp.html#tools.namefind.recognition.api) and the discussion in OPENNLP-627, the adaptive data has to be called after each document has been processed.

      This is missing in the UIMA integration, causing severe degradation when processing more than a few thousand documents.

      The following patch fixes it:

      Index: opennlp-uima/src/main/java/opennlp/uima/namefind/NameFinder.java
      ===================================================================
      — opennlp-uima/src/main/java/opennlp/uima/namefind/NameFinder.java (revision 1670422)
      +++ opennlp-uima/src/main/java/opennlp/uima/namefind/NameFinder.java (working copy)
      @@ -169,6 +169,8 @@
      documentConfidence.add(prob);
      }

      + mNameFinder.clearAdaptiveData();
      +
      return names;
      }

        Attachments

          Activity

            People

            • Assignee:
              teofili Tommaso Teofili
              Reporter:
              pdubouekeatext Pablo Duboue (KeaText)
            • Votes:
              0 Vote for this issue
              Watchers:
              2 Start watching this issue

              Dates

              • Created:
                Updated:
                Resolved: