Uploaded image for project: 'Tika'
  1. Tika
  2. TIKA-1590

A particular PDF seems to trigger an infinite loop when being converted to HTML

    XMLWordPrintableJSON

Details

    • Bug
    • Status: Closed
    • Major
    • Resolution: Duplicate
    • 1.6, 1.7
    • 1.8
    • None
    • None

    Description

      The PDF at http://www.comcare.gov.au/__data/assets/pdf_file/0019/117244/National_Audit_tool_CTH_Audit_Report_PDF,_292_KB.pdf (which I'll also attach) appears to trigger an infinite loop (or at least is exceedingly slow) when being filtered by Tika.

      java -jar tika-app-1.7.jar National_Audit_tool_CTH_Audit_Report_PDF,_292_KB.pdf
      <?xml version="1.0" encoding="UTF-8"?><html xmlns="http://www.w3.org/1999/xhtml">
      <head>
      <meta name="date" content="2015-02-05T04:48:30Z"/>
      <meta name="pdf:PDFVersion" content="1.6"/>
      <meta name="xmp:CreatorTool" content="Adobe InDesign CC 2014 (Macintosh)"/>
      <meta name="dc:description" content="Licensee Improvement"/>
      <meta name="Keywords" content="Licensee, Improvement, Program, NAT, CTH, Report&#13;&#10;"/>
      <meta name="subject" content="Licensee Improvement"/>
      <meta name="dc:creator" content="Comcare"/>
      <meta name="description" content="Licensee Improvement"/>
      <meta name="dcterms:created" content="2014-10-07T02:46:10Z"/>
      <meta name="Last-Modified" content="2015-02-05T04:48:30Z"/>
      <meta name="dcterms:modified" content="2015-02-05T04:48:30Z"/>
      <meta name="dc:format" content="application/pdf; version=1.6"/>
      <meta name="Last-Save-Date" content="2015-02-05T04:48:30Z"/>
      <meta name="meta:save-date" content="2015-02-05T04:48:30Z"/>
      <meta name="pdf:encrypted" content="false"/>
      <meta name="dc:title" content="Licensee Improvement Program NAT (CTH) audit report"/>
      <meta name="modified" content="2015-02-05T04:48:30Z"/>
      <meta name="cp:subject" content="Licensee Improvement"/>
      <meta name="Content-Length" content="299338"/>
      <meta name="Content-Type" content="application/pdf"/>
      <meta name="X-Parsed-By" content="org.apache.tika.parser.DefaultParser"/>
      <meta name="X-Parsed-By" content="org.apache.tika.parser.pdf.PDFParser"/>
      <meta name="creator" content="Comcare"/>
      <meta name="meta:author" content="Comcare"/>
      <meta name="dc:subject" content="Licensee, Improvement, Program, NAT, CTH, Report&#13;&#10;"/>
      <meta name="trapped" content="False"/>
      <meta name="meta:creation-date" content="2014-10-07T02:46:10Z"/>
      <meta name="created" content="Tue Oct 07 13:46:10 AEDT 2014"/>
      <meta name="xmpTPg:NPages" content="72"/>
      <meta name="Creation-Date" content="2014-10-07T02:46:10Z"/>
      <meta name="resourceName" content="National_Audit_tool_CTH_Audit_Report_PDF,_292_KB.pdf"/>
      <meta name="meta:keyword" content="Licensee, Improvement, Program, NAT, CTH, Report&#13;&#10;"/>
      <meta name="Author" content="Comcare"/>
      <meta name="producer" content="Adobe PDF Library 11.0"/>
      <title>Licensee Improvement Program NAT (CTH) audit report</title>
      </head>
      <body><div class="page"><p/>
      <p>LICENSEE
      IMPROVEMENT
      PROGRAM
      
      [snip]
      
      </p>
      <p>Finding:
      </p>
      <p>Evidence:
      </p>
      <p>Comment:
      </p>
      <p>Observation:
      </p>
      <p>Non-conformance:
      </p>
      <p>
      
      [just appears to hand forever at this point]
      

      The relevant thread's stack is something like...

      "main" #1 prio=5 os_prio=31 tid=0x00007fbd6900b000 nid=0xf07 runnable [0x000000010fc18000]
         java.lang.Thread.State: RUNNABLE
      	at org.apache.pdfbox.pdmodel.interactive.form.PDField.findFieldType(PDField.java:184)
      	at org.apache.pdfbox.pdmodel.interactive.form.PDField.findFieldType(PDField.java:190)
      	at org.apache.pdfbox.pdmodel.interactive.form.PDField.findFieldType(PDField.java:190)
      	at org.apache.pdfbox.pdmodel.interactive.form.PDField.findFieldType(PDField.java:190)
      	at org.apache.pdfbox.pdmodel.interactive.form.PDField.findFieldType(PDField.java:190)
      	at org.apache.pdfbox.pdmodel.interactive.form.PDField.findFieldType(PDField.java:190)
      	at org.apache.pdfbox.pdmodel.interactive.form.PDField.findFieldType(PDField.java:190)
      	at org.apache.pdfbox.pdmodel.interactive.form.PDField.findFieldType(PDField.java:190)
      	at org.apache.pdfbox.pdmodel.interactive.form.PDField.findFieldType(PDField.java:190)
      	at org.apache.pdfbox.pdmodel.interactive.form.PDField.findFieldType(PDField.java:190)
      	at org.apache.pdfbox.pdmodel.interactive.form.PDField.findFieldType(PDField.java:190)
      	at org.apache.pdfbox.pdmodel.interactive.form.PDField.findFieldType(PDField.java:190)
      	at org.apache.pdfbox.pdmodel.interactive.form.PDField.findFieldType(PDField.java:190)
      	at org.apache.pdfbox.pdmodel.interactive.form.PDField.findFieldType(PDField.java:179)
      	at org.apache.pdfbox.pdmodel.interactive.form.PDFieldFactory.isButton(PDFieldFactory.java:157)
      	at org.apache.pdfbox.pdmodel.interactive.form.PDFieldFactory.createField(PDFieldFactory.java:68)
      	at org.apache.pdfbox.pdmodel.interactive.form.PDField.getKids(PDField.java:550)
      	at org.apache.pdfbox.pdmodel.interactive.form.PDFieldFactory.isButton(PDFieldFactory.java:159)
      	at org.apache.pdfbox.pdmodel.interactive.form.PDFieldFactory.isButton(PDFieldFactory.java:178)
      	at org.apache.pdfbox.pdmodel.interactive.form.PDFieldFactory.isButton(PDFieldFactory.java:178)
      	at org.apache.pdfbox.pdmodel.interactive.form.PDFieldFactory.createField(PDFieldFactory.java:68)
      	at org.apache.pdfbox.pdmodel.interactive.form.PDField.getKids(PDField.java:550)
      	at org.apache.pdfbox.pdmodel.interactive.form.PDFieldFactory.isButton(PDFieldFactory.java:159)
      	at org.apache.pdfbox.pdmodel.interactive.form.PDFieldFactory.createField(PDFieldFactory.java:68)
      	at org.apache.pdfbox.pdmodel.interactive.form.PDField.getKids(PDField.java:550)
      	at org.apache.pdfbox.pdmodel.interactive.form.PDFieldFactory.isButton(PDFieldFactory.java:159)
      	at org.apache.pdfbox.pdmodel.interactive.form.PDFieldFactory.isButton(PDFieldFactory.java:178)
      	at org.apache.pdfbox.pdmodel.interactive.form.PDFieldFactory.createField(PDFieldFactory.java:68)
      	at org.apache.pdfbox.pdmodel.interactive.form.PDField.getKids(PDField.java:550)
      	at org.apache.pdfbox.pdmodel.interactive.form.PDFieldFactory.isButton(PDFieldFactory.java:159)
      	at org.apache.pdfbox.pdmodel.interactive.form.PDFieldFactory.createField(PDFieldFactory.java:68)
      	at org.apache.pdfbox.pdmodel.interactive.form.PDField.getKids(PDField.java:550)
      	at org.apache.pdfbox.pdmodel.interactive.form.PDFieldFactory.isButton(PDFieldFactory.java:159)
      	at org.apache.pdfbox.pdmodel.interactive.form.PDFieldFactory.isButton(PDFieldFactory.java:178)
      	at org.apache.pdfbox.pdmodel.interactive.form.PDFieldFactory.isButton(PDFieldFactory.java:178)
      	at org.apache.pdfbox.pdmodel.interactive.form.PDFieldFactory.createField(PDFieldFactory.java:68)
      	at org.apache.pdfbox.pdmodel.interactive.form.PDField.getKids(PDField.java:550)
      	at org.apache.pdfbox.pdmodel.interactive.form.PDFieldFactory.isButton(PDFieldFactory.java:159)
      	at org.apache.pdfbox.pdmodel.interactive.form.PDFieldFactory.createField(PDFieldFactory.java:68)
      	at org.apache.pdfbox.pdmodel.interactive.form.PDField.getKids(PDField.java:550)
      	at org.apache.pdfbox.pdmodel.interactive.form.PDFieldFactory.isButton(PDFieldFactory.java:159)
      	at org.apache.pdfbox.pdmodel.interactive.form.PDFieldFactory.isButton(PDFieldFactory.java:178)
      	at org.apache.pdfbox.pdmodel.interactive.form.PDFieldFactory.createField(PDFieldFactory.java:68)
      	at org.apache.pdfbox.pdmodel.interactive.form.PDField.getKids(PDField.java:550)
      	at org.apache.pdfbox.pdmodel.interactive.form.PDFieldFactory.isButton(PDFieldFactory.java:159)
      	at org.apache.pdfbox.pdmodel.interactive.form.PDFieldFactory.createField(PDFieldFactory.java:68)
      	at org.apache.pdfbox.pdmodel.interactive.form.PDField.getKids(PDField.java:550)
      	at org.apache.pdfbox.pdmodel.interactive.form.PDFieldFactory.isButton(PDFieldFactory.java:159)
      	at org.apache.pdfbox.pdmodel.interactive.form.PDFieldFactory.isButton(PDFieldFactory.java:178)
      	at org.apache.pdfbox.pdmodel.interactive.form.PDFieldFactory.isButton(PDFieldFactory.java:178)
      	at org.apache.pdfbox.pdmodel.interactive.form.PDFieldFactory.createField(PDFieldFactory.java:68)
      	at org.apache.pdfbox.pdmodel.interactive.form.PDField.getKids(PDField.java:550)
      	at org.apache.pdfbox.pdmodel.interactive.form.PDFieldFactory.isButton(PDFieldFactory.java:159)
      	at org.apache.pdfbox.pdmodel.interactive.form.PDFieldFactory.createField(PDFieldFactory.java:68)
      	at org.apache.pdfbox.pdmodel.interactive.form.PDField.getKids(PDField.java:550)
      	at org.apache.pdfbox.pdmodel.interactive.form.PDFieldFactory.isButton(PDFieldFactory.java:159)
      	at org.apache.pdfbox.pdmodel.interactive.form.PDFieldFactory.isButton(PDFieldFactory.java:178)
      	at org.apache.pdfbox.pdmodel.interactive.form.PDFieldFactory.isButton(PDFieldFactory.java:178)
      	at org.apache.pdfbox.pdmodel.interactive.form.PDFieldFactory.createField(PDFieldFactory.java:68)
      	at org.apache.pdfbox.pdmodel.interactive.form.PDField.getKids(PDField.java:550)
      	at org.apache.pdfbox.pdmodel.interactive.form.PDFieldFactory.isButton(PDFieldFactory.java:159)
      	at org.apache.pdfbox.pdmodel.interactive.form.PDFieldFactory.createField(PDFieldFactory.java:68)
      	at org.apache.pdfbox.pdmodel.interactive.form.PDField.getKids(PDField.java:550)
      	at org.apache.pdfbox.pdmodel.interactive.form.PDFieldFactory.isButton(PDFieldFactory.java:159)
      	at org.apache.pdfbox.pdmodel.interactive.form.PDFieldFactory.createField(PDFieldFactory.java:68)
      	at org.apache.pdfbox.pdmodel.interactive.form.PDField.getKids(PDField.java:550)
      	at org.apache.pdfbox.pdmodel.interactive.form.PDFieldFactory.isButton(PDFieldFactory.java:159)
      	at org.apache.pdfbox.pdmodel.interactive.form.PDFieldFactory.createField(PDFieldFactory.java:68)
      	at org.apache.pdfbox.pdmodel.interactive.form.PDField.getKids(PDField.java:550)
      	at org.apache.pdfbox.pdmodel.interactive.form.PDFieldFactory.isButton(PDFieldFactory.java:159)
      	at org.apache.pdfbox.pdmodel.interactive.form.PDFieldFactory.createField(PDFieldFactory.java:68)
      	at org.apache.pdfbox.pdmodel.interactive.form.PDField.getKids(PDField.java:550)
      	at org.apache.pdfbox.pdmodel.interactive.form.PDFieldFactory.isButton(PDFieldFactory.java:159)
      	at org.apache.pdfbox.pdmodel.interactive.form.PDFieldFactory.createField(PDFieldFactory.java:68)
      	at org.apache.pdfbox.pdmodel.interactive.form.PDField.getKids(PDField.java:550)
      	at org.apache.pdfbox.pdmodel.interactive.form.PDFieldFactory.isButton(PDFieldFactory.java:159)
      	at org.apache.pdfbox.pdmodel.interactive.form.PDFieldFactory.createField(PDFieldFactory.java:68)
      	at org.apache.pdfbox.pdmodel.interactive.form.PDField.getKids(PDField.java:550)
      	at org.apache.pdfbox.pdmodel.interactive.form.PDFieldFactory.isButton(PDFieldFactory.java:159)
      	at org.apache.pdfbox.pdmodel.interactive.form.PDFieldFactory.createField(PDFieldFactory.java:68)
      	at org.apache.pdfbox.pdmodel.interactive.form.PDField.getKids(PDField.java:550)
      	at org.apache.pdfbox.pdmodel.interactive.form.PDFieldFactory.isButton(PDFieldFactory.java:159)
      	at org.apache.pdfbox.pdmodel.interactive.form.PDFieldFactory.createField(PDFieldFactory.java:68)
      	at org.apache.pdfbox.pdmodel.interactive.form.PDField.getKids(PDField.java:550)
      	at org.apache.pdfbox.pdmodel.interactive.form.PDFieldFactory.isButton(PDFieldFactory.java:159)
      	at org.apache.pdfbox.pdmodel.interactive.form.PDFieldFactory.createField(PDFieldFactory.java:68)
      	at org.apache.pdfbox.pdmodel.interactive.form.PDField.getKids(PDField.java:550)
      	at org.apache.pdfbox.pdmodel.interactive.form.PDFieldFactory.isButton(PDFieldFactory.java:159)
      	at org.apache.pdfbox.pdmodel.interactive.form.PDFieldFactory.createField(PDFieldFactory.java:68)
      	at org.apache.pdfbox.pdmodel.interactive.form.PDField.getKids(PDField.java:550)
      	at org.apache.pdfbox.pdmodel.interactive.form.PDFieldFactory.isButton(PDFieldFactory.java:159)
      	at org.apache.pdfbox.pdmodel.interactive.form.PDFieldFactory.createField(PDFieldFactory.java:68)
      	at org.apache.pdfbox.pdmodel.interactive.form.PDField.getKids(PDField.java:550)
      	at org.apache.pdfbox.pdmodel.interactive.form.PDFieldFactory.isButton(PDFieldFactory.java:159)
      	at org.apache.pdfbox.pdmodel.interactive.form.PDFieldFactory.createField(PDFieldFactory.java:68)
      	at org.apache.pdfbox.pdmodel.interactive.form.PDField.getKids(PDField.java:550)
      	at org.apache.pdfbox.pdmodel.interactive.form.PDFieldFactory.isButton(PDFieldFactory.java:159)
      	at org.apache.pdfbox.pdmodel.interactive.form.PDFieldFactory.createField(PDFieldFactory.java:68)
      	at org.apache.pdfbox.pdmodel.interactive.form.PDField.getKids(PDField.java:550)
      	at org.apache.pdfbox.pdmodel.interactive.form.PDFieldFactory.isButton(PDFieldFactory.java:159)
      	at org.apache.pdfbox.pdmodel.interactive.form.PDFieldFactory.createField(PDFieldFactory.java:68)
      	at org.apache.pdfbox.pdmodel.interactive.form.PDField.getKids(PDField.java:550)
      	at org.apache.pdfbox.pdmodel.interactive.form.PDFieldFactory.isButton(PDFieldFactory.java:159)
      	at org.apache.pdfbox.pdmodel.interactive.form.PDFieldFactory.createField(PDFieldFactory.java:68)
      	at org.apache.pdfbox.pdmodel.interactive.form.PDField.getKids(PDField.java:550)
      	at org.apache.pdfbox.pdmodel.interactive.form.PDFieldFactory.isButton(PDFieldFactory.java:159)
      	at org.apache.pdfbox.pdmodel.interactive.form.PDFieldFactory.createField(PDFieldFactory.java:68)
      	at org.apache.pdfbox.pdmodel.interactive.form.PDField.getKids(PDField.java:550)
      	at org.apache.pdfbox.pdmodel.interactive.form.PDFieldFactory.isButton(PDFieldFactory.java:159)
      	at org.apache.pdfbox.pdmodel.interactive.form.PDFieldFactory.createField(PDFieldFactory.java:68)
      	at org.apache.pdfbox.pdmodel.interactive.form.PDField.getKids(PDField.java:550)
      	at org.apache.pdfbox.pdmodel.interactive.form.PDFieldFactory.isButton(PDFieldFactory.java:159)
      	at org.apache.pdfbox.pdmodel.interactive.form.PDFieldFactory.createField(PDFieldFactory.java:68)
      	at org.apache.pdfbox.pdmodel.interactive.form.PDField.getKids(PDField.java:550)
      	at org.apache.pdfbox.pdmodel.interactive.form.PDFieldFactory.isButton(PDFieldFactory.java:159)
      	at org.apache.pdfbox.pdmodel.interactive.form.PDFieldFactory.createField(PDFieldFactory.java:68)
      	at org.apache.pdfbox.pdmodel.interactive.form.PDField.getKids(PDField.java:550)
      	at org.apache.pdfbox.pdmodel.interactive.form.PDFieldFactory.isButton(PDFieldFactory.java:159)
      	at org.apache.pdfbox.pdmodel.interactive.form.PDFieldFactory.createField(PDFieldFactory.java:68)
      	at org.apache.pdfbox.pdmodel.interactive.form.PDField.getKids(PDField.java:550)
      	at org.apache.pdfbox.pdmodel.interactive.form.PDFieldFactory.isButton(PDFieldFactory.java:159)
      	at org.apache.pdfbox.pdmodel.interactive.form.PDFieldFactory.createField(PDFieldFactory.java:68)
      	at org.apache.pdfbox.pdmodel.interactive.form.PDField.getKids(PDField.java:550)
      	at org.apache.pdfbox.pdmodel.interactive.form.PDFieldFactory.isButton(PDFieldFactory.java:159)
      	at org.apache.pdfbox.pdmodel.interactive.form.PDFieldFactory.createField(PDFieldFactory.java:68)
      	at org.apache.pdfbox.pdmodel.interactive.form.PDField.getKids(PDField.java:550)
      	at org.apache.pdfbox.pdmodel.interactive.form.PDFieldFactory.isButton(PDFieldFactory.java:159)
      	at org.apache.pdfbox.pdmodel.interactive.form.PDFieldFactory.createField(PDFieldFactory.java:68)
      	at org.apache.pdfbox.pdmodel.interactive.form.PDField.getKids(PDField.java:550)
      	at org.apache.pdfbox.pdmodel.interactive.form.PDFieldFactory.isButton(PDFieldFactory.java:159)
      	at org.apache.pdfbox.pdmodel.interactive.form.PDFieldFactory.createField(PDFieldFactory.java:68)
      	at org.apache.pdfbox.pdmodel.interactive.form.PDField.getKids(PDField.java:550)
      	at org.apache.pdfbox.pdmodel.interactive.form.PDFieldFactory.isButton(PDFieldFactory.java:159)
      	at org.apache.pdfbox.pdmodel.interactive.form.PDFieldFactory.createField(PDFieldFactory.java:68)
      	at org.apache.pdfbox.pdmodel.interactive.form.PDField.getKids(PDField.java:550)
      	at org.apache.pdfbox.pdmodel.interactive.form.PDFieldFactory.isButton(PDFieldFactory.java:159)
      	at org.apache.pdfbox.pdmodel.interactive.form.PDFieldFactory.createField(PDFieldFactory.java:68)
      	at org.apache.pdfbox.pdmodel.interactive.form.PDField.getKids(PDField.java:550)
      	at org.apache.pdfbox.pdmodel.interactive.form.PDFieldFactory.isButton(PDFieldFactory.java:159)
      	at org.apache.pdfbox.pdmodel.interactive.form.PDFieldFactory.createField(PDFieldFactory.java:68)
      	at org.apache.pdfbox.pdmodel.interactive.form.PDField.getKids(PDField.java:550)
      	at org.apache.pdfbox.pdmodel.interactive.form.PDFieldFactory.isButton(PDFieldFactory.java:159)
      	at org.apache.pdfbox.pdmodel.interactive.form.PDFieldFactory.createField(PDFieldFactory.java:68)
      	at org.apache.pdfbox.pdmodel.interactive.form.PDField.getKids(PDField.java:550)
      	at org.apache.pdfbox.pdmodel.interactive.form.PDFieldFactory.isButton(PDFieldFactory.java:159)
      	at org.apache.pdfbox.pdmodel.interactive.form.PDFieldFactory.createField(PDFieldFactory.java:68)
      	at org.apache.pdfbox.pdmodel.interactive.form.PDField.getKids(PDField.java:550)
      	at org.apache.pdfbox.pdmodel.interactive.form.PDFieldFactory.isButton(PDFieldFactory.java:159)
      	at org.apache.pdfbox.pdmodel.interactive.form.PDFieldFactory.createField(PDFieldFactory.java:68)
      	at org.apache.pdfbox.pdmodel.interactive.form.PDField.getKids(PDField.java:550)
      	at org.apache.pdfbox.pdmodel.interactive.form.PDFieldFactory.isButton(PDFieldFactory.java:159)
      	at org.apache.pdfbox.pdmodel.interactive.form.PDFieldFactory.createField(PDFieldFactory.java:68)
      	at org.apache.pdfbox.pdmodel.interactive.form.PDField.getKids(PDField.java:550)
      	at org.apache.pdfbox.pdmodel.interactive.form.PDFieldFactory.isButton(PDFieldFactory.java:159)
      	at org.apache.pdfbox.pdmodel.interactive.form.PDFieldFactory.createField(PDFieldFactory.java:68)
      	at org.apache.pdfbox.pdmodel.interactive.form.PDField.getKids(PDField.java:550)
      	at org.apache.pdfbox.pdmodel.interactive.form.PDFieldFactory.isButton(PDFieldFactory.java:159)
      	at org.apache.pdfbox.pdmodel.interactive.form.PDFieldFactory.createField(PDFieldFactory.java:68)
      	at org.apache.pdfbox.pdmodel.interactive.form.PDField.getKids(PDField.java:550)
      	at org.apache.pdfbox.pdmodel.interactive.form.PDFieldFactory.isButton(PDFieldFactory.java:159)
      	at org.apache.pdfbox.pdmodel.interactive.form.PDFieldFactory.createField(PDFieldFactory.java:68)
      	at org.apache.pdfbox.pdmodel.interactive.form.PDField.getKids(PDField.java:550)
      	at org.apache.pdfbox.pdmodel.interactive.form.PDFieldFactory.isButton(PDFieldFactory.java:159)
      	at org.apache.pdfbox.pdmodel.interactive.form.PDFieldFactory.createField(PDFieldFactory.java:68)
      	at org.apache.pdfbox.pdmodel.interactive.form.PDField.getKids(PDField.java:550)
      	at org.apache.pdfbox.pdmodel.interactive.form.PDFieldFactory.isButton(PDFieldFactory.java:159)
      	at org.apache.pdfbox.pdmodel.interactive.form.PDFieldFactory.createField(PDFieldFactory.java:68)
      	at org.apache.pdfbox.pdmodel.interactive.form.PDField.getKids(PDField.java:550)
      	at org.apache.pdfbox.pdmodel.interactive.form.PDFieldFactory.isButton(PDFieldFactory.java:159)
      	at org.apache.pdfbox.pdmodel.interactive.form.PDFieldFactory.createField(PDFieldFactory.java:68)
      	at org.apache.pdfbox.pdmodel.interactive.form.PDField.getKids(PDField.java:550)
      	at org.apache.pdfbox.pdmodel.interactive.form.PDFieldFactory.isButton(PDFieldFactory.java:159)
      	at org.apache.pdfbox.pdmodel.interactive.form.PDFieldFactory.createField(PDFieldFactory.java:68)
      	at org.apache.pdfbox.pdmodel.interactive.form.PDField.getKids(PDField.java:550)
      	at org.apache.pdfbox.pdmodel.interactive.form.PDFieldFactory.isButton(PDFieldFactory.java:159)
      	at org.apache.pdfbox.pdmodel.interactive.form.PDFieldFactory.createField(PDFieldFactory.java:68)
      	at org.apache.pdfbox.pdmodel.interactive.form.PDField.getKids(PDField.java:550)
      	at org.apache.pdfbox.pdmodel.interactive.form.PDFieldFactory.isButton(PDFieldFactory.java:159)
      	at org.apache.pdfbox.pdmodel.interactive.form.PDFieldFactory.createField(PDFieldFactory.java:68)
      	at org.apache.pdfbox.pdmodel.interactive.form.PDField.getKids(PDField.java:550)
      	at org.apache.pdfbox.pdmodel.interactive.form.PDFieldFactory.isButton(PDFieldFactory.java:159)
      	at org.apache.pdfbox.pdmodel.interactive.form.PDFieldFactory.createField(PDFieldFactory.java:68)
      	at org.apache.pdfbox.pdmodel.interactive.form.PDField.getKids(PDField.java:550)
      	at org.apache.pdfbox.pdmodel.interactive.form.PDFieldFactory.isButton(PDFieldFactory.java:159)
      	at org.apache.pdfbox.pdmodel.interactive.form.PDFieldFactory.createField(PDFieldFactory.java:68)
      	at org.apache.pdfbox.pdmodel.interactive.form.PDField.getKids(PDField.java:550)
      	at org.apache.pdfbox.pdmodel.interactive.form.PDFieldFactory.isButton(PDFieldFactory.java:159)
      	at org.apache.pdfbox.pdmodel.interactive.form.PDFieldFactory.createField(PDFieldFactory.java:68)
      	at org.apache.pdfbox.pdmodel.interactive.form.PDField.getKids(PDField.java:550)
      	at org.apache.pdfbox.pdmodel.interactive.form.PDFieldFactory.isButton(PDFieldFactory.java:159)
      	at org.apache.pdfbox.pdmodel.interactive.form.PDFieldFactory.createField(PDFieldFactory.java:68)
      	at org.apache.pdfbox.pdmodel.interactive.form.PDField.getKids(PDField.java:550)
      	at org.apache.pdfbox.pdmodel.interactive.form.PDFieldFactory.isButton(PDFieldFactory.java:159)
      	at org.apache.pdfbox.pdmodel.interactive.form.PDFieldFactory.createField(PDFieldFactory.java:68)
      	at org.apache.pdfbox.pdmodel.interactive.form.PDField.getKids(PDField.java:550)
      	at org.apache.pdfbox.pdmodel.interactive.form.PDFieldFactory.isButton(PDFieldFactory.java:159)
      	at org.apache.pdfbox.pdmodel.interactive.form.PDFieldFactory.createField(PDFieldFactory.java:68)
      	at org.apache.pdfbox.pdmodel.interactive.form.PDField.getKids(PDField.java:550)
      	at org.apache.pdfbox.pdmodel.interactive.form.PDFieldFactory.isButton(PDFieldFactory.java:159)
      	at org.apache.pdfbox.pdmodel.interactive.form.PDFieldFactory.createField(PDFieldFactory.java:68)
      	at org.apache.pdfbox.pdmodel.interactive.form.PDField.getKids(PDField.java:550)
      	at org.apache.pdfbox.pdmodel.interactive.form.PDFieldFactory.isButton(PDFieldFactory.java:159)
      	at org.apache.pdfbox.pdmodel.interactive.form.PDFieldFactory.createField(PDFieldFactory.java:68)
      	at org.apache.pdfbox.pdmodel.interactive.form.PDField.getKids(PDField.java:550)
      	at org.apache.pdfbox.pdmodel.interactive.form.PDFieldFactory.isButton(PDFieldFactory.java:159)
      	at org.apache.pdfbox.pdmodel.interactive.form.PDFieldFactory.createField(PDFieldFactory.java:68)
      	at org.apache.pdfbox.pdmodel.interactive.form.PDField.getKids(PDField.java:550)
      	at org.apache.pdfbox.pdmodel.interactive.form.PDFieldFactory.isButton(PDFieldFactory.java:159)
      	at org.apache.pdfbox.pdmodel.interactive.form.PDFieldFactory.createField(PDFieldFactory.java:68)
      	at org.apache.pdfbox.pdmodel.interactive.form.PDField.getKids(PDField.java:550)
      	at org.apache.pdfbox.pdmodel.interactive.form.PDFieldFactory.isButton(PDFieldFactory.java:159)
      	at org.apache.pdfbox.pdmodel.interactive.form.PDFieldFactory.createField(PDFieldFactory.java:68)
      	at org.apache.pdfbox.pdmodel.interactive.form.PDField.getKids(PDField.java:550)
      	at org.apache.pdfbox.pdmodel.interactive.form.PDFieldFactory.isButton(PDFieldFactory.java:159)
      	at org.apache.pdfbox.pdmodel.interactive.form.PDFieldFactory.createField(PDFieldFactory.java:68)
      	at org.apache.pdfbox.pdmodel.interactive.form.PDField.getKids(PDField.java:550)
      	at org.apache.pdfbox.pdmodel.interactive.form.PDFieldFactory.isButton(PDFieldFactory.java:159)
      	at org.apache.pdfbox.pdmodel.interactive.form.PDFieldFactory.createField(PDFieldFactory.java:68)
      	at org.apache.pdfbox.pdmodel.interactive.form.PDField.getKids(PDField.java:550)
      	at org.apache.pdfbox.pdmodel.interactive.form.PDFieldFactory.isButton(PDFieldFactory.java:159)
      	at org.apache.pdfbox.pdmodel.interactive.form.PDFieldFactory.createField(PDFieldFactory.java:68)
      	at org.apache.pdfbox.pdmodel.interactive.form.PDField.getKids(PDField.java:550)
      	at org.apache.pdfbox.pdmodel.interactive.form.PDFieldFactory.isButton(PDFieldFactory.java:159)
      	at org.apache.pdfbox.pdmodel.interactive.form.PDFieldFactory.createField(PDFieldFactory.java:68)
      	at org.apache.pdfbox.pdmodel.interactive.form.PDField.getKids(PDField.java:550)
      	at org.apache.pdfbox.pdmodel.interactive.form.PDFieldFactory.isButton(PDFieldFactory.java:159)
      	at org.apache.pdfbox.pdmodel.interactive.form.PDFieldFactory.createField(PDFieldFactory.java:68)
      	at org.apache.pdfbox.pdmodel.interactive.form.PDField.getKids(PDField.java:550)
      	at org.apache.pdfbox.pdmodel.interactive.form.PDFieldFactory.isButton(PDFieldFactory.java:159)
      	at org.apache.pdfbox.pdmodel.interactive.form.PDFieldFactory.createField(PDFieldFactory.java:68)
      	at org.apache.pdfbox.pdmodel.interactive.form.PDField.getKids(PDField.java:550)
      	at org.apache.pdfbox.pdmodel.interactive.form.PDFieldFactory.isButton(PDFieldFactory.java:159)
      	at org.apache.pdfbox.pdmodel.interactive.form.PDFieldFactory.createField(PDFieldFactory.java:68)
      	at org.apache.pdfbox.pdmodel.interactive.form.PDField.getKids(PDField.java:550)
      	at org.apache.pdfbox.pdmodel.interactive.form.PDFieldFactory.isButton(PDFieldFactory.java:159)
      	at org.apache.pdfbox.pdmodel.interactive.form.PDFieldFactory.createField(PDFieldFactory.java:68)
      	at org.apache.pdfbox.pdmodel.interactive.form.PDAcroForm.getFields(PDAcroForm.java:203)
      	at org.apache.tika.parser.pdf.PDF2XHTML.extractAcroForm(PDF2XHTML.java:586)
      	at org.apache.tika.parser.pdf.PDF2XHTML.endDocument(PDF2XHTML.java:212)
      	at org.apache.pdfbox.util.PDFTextStripper.writeText(PDFTextStripper.java:345)
      	at org.apache.tika.parser.pdf.PDF2XHTML.process(PDF2XHTML.java:134)
      	at org.apache.tika.parser.pdf.PDFParser.parse(PDFParser.java:146)
      	at org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:256)
      	at org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:256)
      	at org.apache.tika.parser.AutoDetectParser.parse(AutoDetectParser.java:120)
      	at org.apache.tika.cli.TikaCLI$OutputType.process(TikaCLI.java:146)
      	at org.apache.tika.cli.TikaCLI.process(TikaCLI.java:440)
      	at org.apache.tika.cli.TikaCLI.main(TikaCLI.java:116)
      

      Attachments

        1. National_Audit_tool_CTH_Audit_Report_PDF,_292_KB.pdf
          292 kB
          Matt Sheppard
        2. jstack.txt
          24 kB
          Matt Sheppard

        Issue Links

          Activity

            People

              Unassigned Unassigned
              mattsheppard Matt Sheppard
              Votes:
              0 Vote for this issue
              Watchers:
              3 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: