Uploaded image for project: 'Tika'
  1. Tika
  2. TIKA-1955 MIME types updates and additions for Scientific Data based on TREC-DD-Polar
  3. TIKA-1881

Updates to MIME types for Postscript, WordPerfect, image and RSS based on Polar analysis

    XMLWordPrintableJSON

Details

    • Sub-task
    • Status: Resolved
    • Minor
    • Resolution: Fixed
    • 1.11
    • 1.13
    • mime

    Description

      Updated Mime-Magic for 6 mime types:
      1. application/postscript : files begin with pattern "%!PS-Adobe-3.0 EPSF-3.0".
      2. application/wordperfect: files begin with pattern "├┐WPC" .
      3. image/tiff : updated pattern for "MM.+" for Big endian format.(occur at the beginning of files of tiff mime type)
      4. application/rdf+xml : updated pattern "rdf" ( from byte offset 5 to 400)
      5. application/atom+xml : updated pattern "feed" ( from byte offset 5 to 50)
      6. application/rss+xml : updated pattern "rss" ( from byte offset 5 to 50)

      https://github.com/NamithaGS/tika/commit/780100767e24505a24595ea6db43978d0700e220

      Attachments

        Activity

          People

            chrismattmann Chris A. Mattmann
            ganiga@usc.edu Namitha Sanjeeva Ganiga
            Votes:
            0 Vote for this issue
            Watchers:
            5 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved: