[PDFBOX-601] PDFBox performance issue: PDSimpleFont, PDFont performance tweaks - ASF JIRA

XML

Word

Printable

JSON

Details

Type: Improvement
Status: Closed
Priority: Major
Resolution: Fixed
Affects Version/s: 0.8.0-incubator
Fix Version/s: 1.0.0
Component/s: PDModel
Labels:
None
Environment:
All

Description

During text extraction, font size / descriptor / encoding attributes are accessed repeatedly in order to do positional calculations and byte-character conversions.

The current code has several accessors for these things that redo rather slow calculations each time - even thought the font object state is not changed.

The results of these calculations should be persisted in instance fields once calculated. This greatly improves performance.

I'll attach new versions of PDFont, PDFontDescriptorDictionary and PDSimpleFont that have these tweaks.

Attachments

- Sort By Name
- Sort By Date
- Ascending
- Descending

PDSimpleFont.java
14/Jan/10 21:33
12 kB
Mel Martinez
PDFontDescriptorDictionary.java
14/Jan/10 21:33
15 kB
Mel Martinez
PDFont.java
14/Jan/10 21:33
30 kB
Mel Martinez

Activity

People

Assignee:: Jukka Zitting

Reporter:: Mel Martinez

Votes:: 0 Vote for this issue

Watchers:: 0 Start watching this issue

Dates

Created:: 14/Jan/10 21:31

Updated:: 22/Feb/10 18:28

Resolved:: 14/Jan/10 23:33