Uploaded image for project: 'PDFBox'
  1. PDFBox
  2. PDFBOX-1792

Different metadata with NonSequentialPDFParser

    Details

    • Type: Bug
    • Status: Open
    • Priority: Minor
    • Resolution: Unresolved
    • Affects Version/s: 1.8.8, 2.0.0
    • Fix Version/s: 3.0.0 PDFBox
    • Component/s: Parsing, XmpBox
    • Labels:
      None

      Description

      The traditional parser is able to extract metadata from a test document from TIKA-738. The NonSequentialPDFParser is not able to extract metadata from that file. Another file from the Tika test suite has metadata that can be extracted by the NonSequentialPDFParser but not by classic.

        Attachments

        1. testPDF_acroForm2.pdf
          478 kB
          Tim Allison
        2. PDFBOX-1792.tar.gz
          19 kB
          Tim Allison

          Issue Links

            Activity

              People

              • Assignee:
                lehmi Andreas Lehmkühler
                Reporter:
                tallison@apache.org Tim Allison
              • Votes:
                0 Vote for this issue
                Watchers:
                6 Start watching this issue

                Dates

                • Created:
                  Updated: