[TIKA-793] Invalid ASCII character (65533) when retriving MP3 metadata - ASF JIRA

XML

Word

Printable

JSON

Details

Type: Bug
Status: Resolved
Priority: Minor
Resolution: Fixed
Affects Version/s: 1.0
Fix Version/s: 1.1
Component/s: metadata, parser
Labels:
None
Environment:

Ubuntu 10.04 (x64), Android (2.2 +)

Description

When extracting metadata from certain mp3's (the id3 version appears to be 2.4) I'm seeing invalid characters at the end of the parsed fields. For example:

American M�

which should be:

American Me

Attachments

- Sort By Name
- Sort By Date
- Ascending
- Descending

TikaTest.java
27/Nov/11 08:48
2 kB
William Seemann

Activity

People

Assignee:: Unassigned

Reporter:: William Seemann

Votes:: 0 Vote for this issue

Watchers:: 0 Start watching this issue

Dates

Created:: 27/Nov/11 08:45

Updated:: 29/Dec/11 09:12

Resolved:: 29/Dec/11 09:12