Tika
  1. Tika
  2. TIKA-852

Quicktime / MP4 Metadata Parser

    Details

    • Type: Improvement Improvement
    • Status: Open
    • Priority: Major Major
    • Resolution: Unresolved
    • Affects Version/s: 1.0
    • Fix Version/s: None
    • Component/s: parser
    • Labels:
      None

      Description

      From the investigations done for TIKA-851, it looks like a parser for the Quicktime format, and MP4 (which is an extension to it) shouldn't be too hard to do. This should be able to output some of the media metadata, such duration, dimensions, and MP4 audio tags

      Information resources on the format are linked from TIKA-851

      1. TIKA-852.patch
        2 kB
        Sebastian Annies

        Activity

        Hide
        Nick Burch added a comment -

        It looks like the Apache Licensed MP4Parser <http://code.google.com/p/mp4parser/> already provides much of this, so it looks simplest to use that rather than writing our own MP4 reader

        Show
        Nick Burch added a comment - It looks like the Apache Licensed MP4Parser < http://code.google.com/p/mp4parser/ > already provides much of this, so it looks simplest to use that rather than writing our own MP4 reader
        Hide
        Nick Burch added a comment -

        Initial parser added in r1237162. Currently, it largely just handles Apple M4A audio files. There are probably a couple more XMPDM keys we'd want to fully support these, and quite a few more we'd need before we could do video. (We'd also need some test files, and to decide how best to handle video files, see an earlier dev@tika discussion for details of the problems)

        For now, the unit test is disabled, as it fails when asserts are turned on. I think this is a mp4parser bug though, so I've raised <http://code.google.com/p/mp4parser/issues/detail?id=30> for it

        Show
        Nick Burch added a comment - Initial parser added in r1237162. Currently, it largely just handles Apple M4A audio files. There are probably a couple more XMPDM keys we'd want to fully support these, and quite a few more we'd need before we could do video. (We'd also need some test files, and to decide how best to handle video files, see an earlier dev@tika discussion for details of the problems) For now, the unit test is disabled, as it fails when asserts are turned on. I think this is a mp4parser bug though, so I've raised < http://code.google.com/p/mp4parser/issues/detail?id=30 > for it
        Hide
        Sebastian Annies added a comment - - edited

        Hi,
        sorry for not responding/fixing Issue 30 in mp4parser. You should have nagged me more
        Please find attached a patch for this issue. It updates to isoparser-1.0-RC-1 and make the test work.

        Thank you for using my parser!
        Regards,
        Sebastian

        PS.: with RC-1 the parser should be substantially faster

        Show
        Sebastian Annies added a comment - - edited Hi, sorry for not responding/fixing Issue 30 in mp4parser. You should have nagged me more Please find attached a patch for this issue. It updates to isoparser-1.0-RC-1 and make the test work. Thank you for using my parser! Regards, Sebastian PS.: with RC-1 the parser should be substantially faster
        Hide
        Nick Burch added a comment -

        Thanks Sebastian, I've committed your patch in r1331788.

        I'll leave the bug open for now, as we're not doing all the XMPDM keys we probably should do, and we're not doing video yet

        Show
        Nick Burch added a comment - Thanks Sebastian, I've committed your patch in r1331788. I'll leave the bug open for now, as we're not doing all the XMPDM keys we probably should do, and we're not doing video yet

          People

          • Assignee:
            Nick Burch
            Reporter:
            Nick Burch
          • Votes:
            2 Vote for this issue
            Watchers:
            3 Start watching this issue

            Dates

            • Created:
              Updated:

              Development