Uploaded image for project: 'Tika'
  1. Tika
  2. TIKA-3971

Distinguish eps-based Adobe Illustrator files from pdf-based Illustrator files

    XMLWordPrintableJSON

Details

    • Task
    • Status: Resolved
    • Minor
    • Resolution: Fixed
    • None
    • 2.8.0
    • None
    • None

    Description

      On TIKA-2689, we plan to add detection for Illustrator files that are based on/wrapped in PDF files at parse time. Illustrator files used to be eps or just ps. We should figure out how we want to distinguish between these two or three formats.

      TIKA-2689 has some great resource links to help with this.

      Pronom has a bunch of ids for "Illustrator", summarized: http://justsolve.archiveteam.org/wiki/Adobe_Illustrator_Artwork

      One example: https://www.nationalarchives.gov.uk/PRONOM/Format/proFormatSearch.aspx?status=detailReport&id=1350

      See also: https://bugs.ghostscript.com/show_bug.cgi?id=689926

      Attachments

        Issue Links

          Activity

            People

              Unassigned Unassigned
              tallison Tim Allison
              Votes:
              0 Vote for this issue
              Watchers:
              2 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: