Uploaded image for project: 'Tika'
  1. Tika
  2. TIKA-3844

Improve extraction of PDF subset info

    XMLWordPrintableJSON

Details

    • Improvement
    • Status: Resolved
    • Minor
    • Resolution: Fixed
    • None
    • 2.5.0
    • None
    • None

    Description

      We're extracting PDFA part and conformance. We should add extraction for VT, UA, and X.

      We should also finally get rid of the bad hack from 1.x that appended the pdfa conformance to the file type.

      I'd like to thank Peter Wyatt via offline chat for everything that was right about this improvement. The other stuff is all mine.

      Attachments

        Activity

          People

            Unassigned Unassigned
            tallison Tim Allison
            Votes:
            0 Vote for this issue
            Watchers:
            2 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved: