Uploaded image for project: 'OpenNLP'
  1. OpenNLP
  2. OPENNLP-673

Doccat NGramFeatureGenerator not working

    XMLWordPrintableJSON

Details

    • Bug
    • Status: Closed
    • Major
    • Resolution: Fixed
    • tools-1.5.3
    • 1.6.0
    • Doccat
    • None

    Description

      Training a document categorizer module using NGramFeatureGenerator always fails. All events are dropped.

      (...)
      Dropped event talk.religion.misc:[From:, (...)
      Dropped event talk.religion.misc:[From:, (...)
      done.
      Sorting and merging events...

      Exception in thread "main" java.lang.NullPointerException
      at opennlp.tools.ml.maxent.GISTrainer.trainModel(GISTrainer.java:264)
      at opennlp.tools.ml.maxent.GIS.trainModel(GIS.java:298)
      at opennlp.tools.ml.maxent.GIS.doTrain(GIS.java:83)
      at opennlp.tools.ml.maxent.GIS.doTrain(GIS.java:1)
      at opennlp.tools.ml.AbstractEventTrainer.train(AbstractEventTrainer.java:93)
      at opennlp.tools.ml.model.TrainUtil.train(TrainUtil.java:53)
      at opennlp.tools.doccat.DocumentCategorizerME.train(DocumentCategorizerME.java:116)
      at opennlp.tools.doccat.DoccatCrossValidator.evaluate(DoccatCrossValidator.java:72)
      at opennlp.tools.cmdline.doccat.DoccatCrossValidatorTool.run(DoccatCrossValidatorTool.java:99)
      at opennlp.tools.cmdline.CLI.main(CLI.java:224)
      Done indexing.
      Incorporating indexed data for training...

      Attachments

        Activity

          People

            colen William Colen
            colen William Colen
            Votes:
            0 Vote for this issue
            Watchers:
            2 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved: