Tika
  1. Tika
  2. TIKA-793

Invalid ASCII character (65533) when retriving MP3 metadata

    Details

    • Type: Bug Bug
    • Status: Resolved
    • Priority: Minor Minor
    • Resolution: Fixed
    • Affects Version/s: 1.0
    • Fix Version/s: 1.1
    • Component/s: metadata, parser
    • Labels:
      None
    • Environment:

      Ubuntu 10.04 (x64), Android (2.2 +)

      Description

      When extracting metadata from certain mp3's (the id3 version appears to be 2.4) I'm seeing invalid characters at the end of the parsed fields. For example:

      American M�

      which should be:

      American Me

      1. TikaTest.java
        2 kB
        William Seemann

        Activity

          People

          • Assignee:
            Unassigned
            Reporter:
            William Seemann
          • Votes:
            0 Vote for this issue
            Watchers:
            0 Start watching this issue

            Dates

            • Created:
              Updated:
              Resolved:

              Development