Uploaded image for project: 'PDFBox'
  1. PDFBox
  2. PDFBOX-1792

Different metadata with NonSequentialPDFParser

    XMLWordPrintableJSON

Details

    • Bug
    • Status: Closed
    • Minor
    • Resolution: Duplicate
    • 1.8.8, 2.0.0
    • None
    • Parsing, XmpBox
    • None

    Description

      The traditional parser is able to extract metadata from a test document from TIKA-738. The NonSequentialPDFParser is not able to extract metadata from that file. Another file from the Tika test suite has metadata that can be extracted by the NonSequentialPDFParser but not by classic.

      Attachments

        1. PDFBOX-1792.tar.gz
          19 kB
          Tim Allison
        2. testPDF_acroForm2.pdf
          478 kB
          Tim Allison

        Issue Links

          Activity

            People

              lehmi Andreas Lehmkühler
              tallison Tim Allison
              Votes:
              0 Vote for this issue
              Watchers:
              6 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: