Description
The traditional parser is able to extract metadata from a test document from TIKA-738. The NonSequentialPDFParser is not able to extract metadata from that file. Another file from the Tika test suite has metadata that can be extracted by the NonSequentialPDFParser but not by classic.
Attachments
Attachments
Issue Links
- duplicates
-
PDFBOX-1806 Metadata not completely extracted by traditional parser, but is extracted by NonSequentialParser
- Closed
-
PDFBOX-5128 Support parsing non standardized XMP
- Closed
- is depended upon by
-
TIKA-1203 Some metadata not extracted from PDF files when NonSequentialPDFParser is used
- Closed