Uploaded image for project: 'Tika'
  1. Tika
  2. TIKA-1180

Better Matroska MKV and WEBM Detection

    XMLWordPrintableJSON

Details

    • New Feature
    • Status: Open
    • Major
    • Resolution: Unresolved
    • 1.5
    • None
    • detector

    Description

      Following the work on TIKA-1177, we now have mimetype entries for the various formats which are based on the Matroska container (mkv, mka, webm etc). However, we are unable to properly identify the specific type just from some mime magic

      Instead, for fully accurate detection, we'll need a new Detector for the Matroska family, which does some very simple container/stream processing to work out what the container contains

      Attachments

        1. sample-webm.noext
          16 kB
          Wladimir Leite
        2. sample-mkv.noext
          6 kB
          Wladimir Leite

        Issue Links

          Activity

            People

              Unassigned Unassigned
              nick Nick Burch
              Votes:
              0 Vote for this issue
              Watchers:
              5 Start watching this issue

              Dates

                Created:
                Updated: