[TIKA-1285] Upgrade to PDFBox 2.0.0 when available - ASF JIRA

XML

Word

Printable

JSON

Details

Type: Improvement
Status: Closed
Priority: Major
Resolution: Fixed
Affects Version/s: 1.6
Fix Version/s: 1.13
Component/s: parser
Labels:
None

Description

This issue is to track fixes required when upgrading the PDFbox dependency to 2.0.0 Final once it's available, and using PDFBox's daily build before then.

See ~~TIKA-1268~~ comment.

Relates to ~~PDFBOX-1893~~

Attachments

- Sort By Name
- Sort By Date
- Ascending
- Descending

pdfbox_reports_2_0_0_20150709.zip
10/Jul/15 11:52
30 kB
Tim Allison
testPDF_childAttachments.pdf
07/Jul/15 16:53
2.21 MB
Tim Allison
TIKA-1285_rev1641423.patch
25/Nov/14 19:17
39 kB
Jeremy Anderson
TIKA-1285.patch
04/Sep/14 23:13
33 kB
Jeremy Anderson
TIKA-1285v3.patch
07/Jul/15 17:05
45 kB
Tim Allison

Issue Links

blocks

TIKA-1753 Improper word concatenation when extracting pdf

Closed

depends upon

PDFBOX-2862 GlyphList doesn't appear to be thread safe in trunk...or user error?

Closed

PDFBOX-2855 Allow some flexibility for divergences from the standard on Seq vs Bag in DomXMPParser

Closed

is depended upon by

PDFBOX-3128 Latest Apache Tika can't be used together with PDFBox 2.0

Closed

is related to

PDFBOX-2856 Markedly slower processing for particular file in 2.0.0-trunk vs 1.8.9

Closed

PDFBOX-2865 Downgrade logging "Using last-resort fallback for x font" to warn in 2.0.0?

Closed

relates to

PDFBOX-2868 NPE in Acroform getValueAsString

Closed

TIKA-1912 Figure out how to parse truncated PDFs that were handled by PDFBox 1.8.x but not by 2.0.0

Open

TIKA-1300 Switch default PDFBox parser to NonSequentialParser

Resolved

(1 is related to, 3 relates to)

Sub-Tasks

Try to migrate current Tika code around PDFBox 1.8.x from JempBox to XMPBox

Resolved

Tim Allison

Activity

People

Assignee:: Unassigned

Reporter:: Jeremy Anderson

Votes:: 5 Vote for this issue

Watchers:: 12 Start watching this issue

Dates

Created:: 30/Apr/14 00:46

Updated:: 16/Dec/16 16:05

Resolved:: 22/Mar/16 17:39