[TIKA-1203] Some metadata not extracted from PDF files when NonSequentialPDFParser is used - ASF JIRA

XML

Word

Printable

JSON

Details

Type: Bug
Status: Closed
Priority: Minor
Resolution: Not A Problem
Affects Version/s: None
Fix Version/s: None
Component/s: parser
Labels:
None

Description

While working on ~~TIKA-1201~~, I noticed that metadata was not being extracted from the testAnnotations.pdf file when the NonSequentialPDFParser was being used. I opened ~~PDFBOX-1792~~. This TIKA issue is a placeholder. When ~~PDFBOX-1792~~ is fixed, we can stop skipping "testAnnotations.pdf" in PDFParserTest.

Attachments

Issue Links

depends upon

PDFBOX-1792 Different metadata with NonSequentialPDFParser

Closed

is related to

TIKA-1201 Add possibility for switching to pdfbox NonSequentialPDFParser

Closed

Activity

People

Assignee:: Unassigned

Reporter:: Tim Allison

Votes:: 0 Vote for this issue

Watchers:: 3 Start watching this issue

Dates

Created:: 03/Dec/13 16:22

Updated:: 30/Aug/23 18:01

Resolved:: 30/Aug/23 18:01