Uploaded image for project: 'OpenNLP'
  1. OpenNLP
  2. OPENNLP-233

Parser produces "log probabilities" that are positive

    XMLWordPrintableJSON

    Details

      Description

      Using the distributed version and the binary model from the sourceforge site, I see the following bad behaviour. This is bad because probabilities always should be <= 1, so log probabilities should be <= 0, which they clearly are not.

      Script started on Mon Jul 18 19:34:36 2011
      bash-3.2$ bin/opennlp Parser -k 2 models/en-parser-chunking.bin
      Loading Parser model ... done (14.573s)
      The old are wise .
      0 0.06948959676790605 (TOP (S (NP (DT The) (JJ old)) (VP (VBP are) (ADJP (JJ wise))) (. .)))
      1 -1.3788870933108204 (TOP (S (NP (DT The) (JJ old)) (VP (VBP are) (ADVP (RB wise))) (. .)))
      The young are foolish .
      0 0.2094212498812974 (TOP (S (NP (DT The) (JJ young)) (VP (VBP are) (ADJP (JJ foolish))) (. .)))
      1 -2.2380713063683784 (TOP (S (NP (DT The) (NNP young)) (VP (VBP are) (ADJP (JJ foolish))) (. .)))
      ^D

      Average: 0.1 sent/s
      Total: 4 sent
      Runtime: 57.565s
      bash-3.2$ exit

      Script done on Mon Jul 18 19:35:56 2011

        Attachments

          Activity

            People

            • Assignee:
              joern Jörn Kottmann
              Reporter:
              cbrew Chris Brew
            • Votes:
              0 Vote for this issue
              Watchers:
              0 Start watching this issue

              Dates

              • Created:
                Updated:
                Resolved:

                Time Tracking

                Estimated:
                Original Estimate - 48h
                48h
                Remaining:
                Remaining Estimate - 48h
                48h
                Logged:
                Time Spent - Not Specified
                Not Specified