Uploaded image for project: 'Tika'
  1. Tika
  2. TIKA-793

Invalid ASCII character (65533) when retriving MP3 metadata

    XMLWordPrintableJSON

Details

    • Bug
    • Status: Resolved
    • Minor
    • Resolution: Fixed
    • 1.0
    • 1.1
    • metadata, parser
    • None
    • Ubuntu 10.04 (x64), Android (2.2 +)

    Description

      When extracting metadata from certain mp3's (the id3 version appears to be 2.4) I'm seeing invalid characters at the end of the parsed fields. For example:

      American M�

      which should be:

      American Me

      Attachments

        1. TikaTest.java
          2 kB
          William Seemann

        Activity

          People

            Unassigned Unassigned
            wseemann William Seemann
            Votes:
            0 Vote for this issue
            Watchers:
            0 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved: