Uploaded image for project: 'cTAKES'
  1. cTAKES
  2. CTAKES-373

MaxentParserWrapper can't handle section dividers: "=========="

    XMLWordPrintableJSON

Details

    • Important

    Description

      Notes often contain section "dividers" of a single text character, such as:
      "============================================"
      When the Constituency Parser hits these [sentences], it can churn for 30 seconds (in my runs). For 60 notes containing two of these lines, that is a solid hour of useless processing.

      There shouldn't be any downstream dependencies on such lines, so they shouldn't be parsed.

      Attachments

        Activity

          People

            seanfinan Sean Finan
            seanfinan Sean Finan
            Votes:
            0 Vote for this issue
            Watchers:
            2 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved: