Uploaded image for project: 'PDFBox'
  1. PDFBox
  2. PDFBOX-1561

PDFBox throws exception with PDFTextStripper.getText

    XMLWordPrintableJSON

Details

    • Bug
    • Status: Closed
    • Major
    • Resolution: Fixed
    • 1.7.0
    • 1.8.2
    • None
    • None

    Description

      I am using the .NET port of PDFBox 1.7.0. Calling PDFTextStripper::getText throws exception
      java.io.IOException: Not a number: +
      with callstack
      bei org.apache.pdfbox.pdfparser.PDFStreamParser$1.tryNext()
      bei org.apache.pdfbox.pdfparser.PDFStreamParser$1.hasNext()
      bei org.apache.pdfbox.util.PDFStreamEngine.processSubStream(COSStream )
      bei org.apache.pdfbox.util.PDFStreamEngine.processSubStream(PDPage pdp, PDResources pdr, COSStream coss)
      bei org.apache.pdfbox.util.PDFStreamEngine.processStream(PDPage pdp, PDResources pdr, COSStream coss)
      bei org.apache.pdfbox.util.PDFTextStripper.processPage(PDPage pdp, COSStream coss)
      bei org.apache.pdfbox.util.PDFTextStripper.processPages(List l)
      bei org.apache.pdfbox.util.PDFTextStripper.writeText(PDDocument pdd, Writer w)
      bei org.apache.pdfbox.util.PDFTextStripper.getText(PDDocument pdd)

      Attachments

        1. energieausweis.zip
          899 kB
          Markus Griesser

        Activity

          People

            lehmi Andreas Lehmkühler
            source2702 Markus Griesser
            Votes:
            0 Vote for this issue
            Watchers:
            2 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved: