Uploaded image for project: 'OpenNLP'
  1. OpenNLP
  2. OPENNLP-1226

Training an NER model for dates with 'dd.mm.yyyy' as Date format

    XMLWordPrintableJSON

Details

    • Question
    • Status: Closed
    • Minor
    • Resolution: Fixed
    • None
    • 2.5.0
    • Name Finder
    • Important

    Description

      My txt file for model training has date tags in <START:date> dd.mm.yyyy <END> format. But when I try to use the trained .bin file, the dates are not extracted as they should. My txt tagged file is written one sentence in line. I was wondering maybe the format, and the fullstops in this date format make a difficulty for the model to learn. In the official OpenNLP documentation I can see there is a bin file with date extraction, but I can't see the txt file containing the tags.

      I tried to open this bin as a txt format but I read in Stack Overflow that I can't do that.

      https://stackoverflow.com/questions/26140492/how-can-i-view-the-content-of-a-bin-file-in-opennlp

      Attachments

        Activity

          People

            mawiesne Martin Wiesner
            seasoul92 Olga
            Votes:
            3 Vote for this issue
            Watchers:
            5 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved:

              Time Tracking

                Estimated:
                Original Estimate - Not Specified
                Not Specified
                Remaining:
                Remaining Estimate - 0h
                0h
                Logged:
                Time Spent - 6h
                6h