OpenNLP
  1. OpenNLP
  2. OPENNLP-81

Add a cli tool for the doccat evaluation support

    Details

    • Type: Improvement Improvement
    • Status: Resolved
    • Priority: Minor Minor
    • Resolution: Fixed
    • Affects Version/s: None
    • Fix Version/s: 1.6.0
    • Labels:
      None

      Description

      There should be a command line tool which can be used to evaluate the document categorizer model
      on a test file.

        Activity

        Joern Kottmann created issue -
        Hide
        William Colen added a comment - - edited

        Created the evaluation CLI:

        $ bin/opennlp DoccatEvaluator
        Usage: opennlp DoccatEvaluator[.leipzig] [-reportOutputFile outputFile] [-misclassified true|false] -model model -data sampleData [-encoding charsetName]

        Arguments description:
        -reportOutputFile outputFile
        the path of the fine-grained report file.
        -misclassified true|false
        if true will print false negatives and false positives.
        -model model
        the model file to be evaluated.
        -data sampleData
        data to be used, usually a file name.
        -encoding charsetName
        encoding for reading and writing text, if absent the system default is used.

        The reportOutputFile includes F-Measure for each category and a confusion matrix.

        Show
        William Colen added a comment - - edited Created the evaluation CLI: $ bin/opennlp DoccatEvaluator Usage: opennlp DoccatEvaluator [.leipzig] [-reportOutputFile outputFile] [-misclassified true|false] -model model -data sampleData [-encoding charsetName] Arguments description: -reportOutputFile outputFile the path of the fine-grained report file. -misclassified true|false if true will print false negatives and false positives. -model model the model file to be evaluated. -data sampleData data to be used, usually a file name. -encoding charsetName encoding for reading and writing text, if absent the system default is used. The reportOutputFile includes F-Measure for each category and a confusion matrix.
        William Colen made changes -
        Field Original Value New Value
        Assignee William Colen [ colen ]
        William Colen made changes -
        Status Open [ 1 ] Resolved [ 5 ]
        Fix Version/s 1.6.0 [ 12316450 ]
        Resolution Fixed [ 1 ]

          People

          • Assignee:
            William Colen
            Reporter:
            Joern Kottmann
          • Votes:
            0 Vote for this issue
            Watchers:
            1 Start watching this issue

            Dates

            • Created:
              Updated:
              Resolved:

              Development