Uploaded image for project: 'OpenNLP'
  1. OpenNLP
  2. OPENNLP-1446

Investigate why LeskEvaluatorTest and MFSEvaluatorTest fail while parsing 'EnglishLS.train'

VotersStop watchingWatchersLinkCloneUpdate Comment AuthorReplace String in CommentUpdate Comment VisibilityDelete Comments
    XMLWordPrintableJSON

Details

    Description

      The LeskEvaluatorTest & MFSEvaluatorTest in the opennlp-wsd sandbox component both fail parsing the 'EnglishLS.train' file. The data is kept original, downloaded from https://web.eecs.umich.edu/~mihalcea/senseval/senseval3/data.html

      Aims:

      • Investigate what causes the xml parsing to fail
      • Fix it and make both existing tests pass
      • Optional: Improve the existing test code to be more strict.

      Note:

      The test setup to reproduce this is on a branch and to be merged into the main branch.

      Attachments

        Activity

          This comment will be Viewable by All Users Viewable by All Users
          Cancel

          People

            mawiesne Martin Wiesner
            mawiesne Martin Wiesner
            Votes:
            0 Vote for this issue
            Watchers:
            2 Stop watching this issue

            Dates

              Created:
              Updated:
              Resolved:

              Time Tracking

                Estimated:
                Original Estimate - Not Specified
                Not Specified
                Remaining:
                Remaining Estimate - 0h
                0h
                Logged:
                Time Spent - 5h
                5h

                Slack

                  Issue deployment