Uploaded image for project: 'Tika'
  1. Tika
  2. TIKA-991

Mp3Parser cannot extract the duration of an audio file

    XMLWordPrintableJSON

Details

    • Improvement
    • Status: Closed
    • Major
    • Resolution: Fixed
    • 1.2
    • 1.5
    • parser

    Description

      The duration of an MP3 file is an important information. Currently, Mp3Parser is not able to extract this data from a file.

      Unfortunately, it is not easy to implement this functionality with the current design of the Mp3Parser class. In order to obtain the duration, all audio frames have to be read, and their durations have to be summed up. Currently, only the beginning (the ID3v2 tags) and the end (lyrics and ID3v1) are evaluated. Processing the whole file would have additional benefits, e.g. MP3s with variable bit rates could be detected.

      Attachments

        1. tika-991_3.patch
          1 kB
          Oliver Heger
        2. tika-991_2.patch
          55 kB
          Oliver Heger

        Issue Links

          Activity

            People

              chrismattmann Chris A. Mattmann
              oheger Oliver Heger
              Votes:
              0 Vote for this issue
              Watchers:
              6 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: