Uploaded image for project: 'Tika'
  1. Tika
  2. TIKA-1739

cTAKESParser doesn't work in 1.11

    XMLWordPrintableJSON

    Details

    • Type: Bug
    • Status: Resolved
    • Priority: Major
    • Resolution: Won't Fix
    • Affects Version/s: None
    • Fix Version/s: 1.11
    • Component/s: parser, server
    • Labels:
      None

      Description

      Tika cTAKESParser integration doesn't work in 1.11. The parser is called, but blank metadata comes back:

      curl -T test.txt -H "Content-Type: text/plain" http://localhost:9999/rmeta/text
      [{"Content-Type":"text/plain","X-Parsed-By":["org.apache.tika.parser.CompositeParser","org.apache.tika.parser.ctakes.CTAKESParser","org.apache.tika.parser.EmptyParser"],"X-TIKA:parse_time_millis":"20371","ctakes:schema":"coveredText:start:end:ontologyConceptArr"}
      

      [~gagravarr] I wonder if something that happened in TIKA-1653 broke it?
      http://svn.apache.org/viewvc?view=revision&revision=1684199

      Giuseppe Totaro can you help me look here?

      I'm working on https://github.com/chrismattmann/shangridocs/tree/convert-wicket which is where I first saw this.

        Attachments

        1. TIKA-1739.patch
          0.6 kB
          Chris A. Mattmann

          Activity

            People

            • Assignee:
              chrismattmann Chris A. Mattmann
              Reporter:
              chrismattmann Chris A. Mattmann
            • Votes:
              0 Vote for this issue
              Watchers:
              3 Start watching this issue

              Dates

              • Created:
                Updated:
                Resolved: