Uploaded image for project: 'Tika'
  1. Tika
  2. TIKA-1955 MIME types updates and additions for Scientific Data based on TREC-DD-Polar
  3. TIKA-1881

Updates to MIME types for Postscript, WordPerfect, image and RSS based on Polar analysis

    Details

    • Type: Sub-task
    • Status: Resolved
    • Priority: Minor
    • Resolution: Fixed
    • Affects Version/s: 1.11
    • Fix Version/s: 1.13
    • Component/s: mime
    • Labels:

      Description

      Updated Mime-Magic for 6 mime types:
      1. application/postscript : files begin with pattern "%!PS-Adobe-3.0 EPSF-3.0".
      2. application/wordperfect: files begin with pattern "├┐WPC" .
      3. image/tiff : updated pattern for "MM.+" for Big endian format.(occur at the beginning of files of tiff mime type)
      4. application/rdf+xml : updated pattern "rdf" ( from byte offset 5 to 400)
      5. application/atom+xml : updated pattern "feed" ( from byte offset 5 to 50)
      6. application/rss+xml : updated pattern "rss" ( from byte offset 5 to 50)

      https://github.com/NamithaGS/tika/commit/780100767e24505a24595ea6db43978d0700e220

        Attachments

          Activity

            People

            • Assignee:
              chrismattmann Chris A. Mattmann
              Reporter:
              ganiga@usc.edu Namitha Sanjeeva Ganiga
            • Votes:
              0 Vote for this issue
              Watchers:
              5 Start watching this issue

              Dates

              • Created:
                Updated:
                Resolved: