Description
The SVG file attached to the issue is interpreted as text/plain by
tika.detect(filePath)
If I add
<?xml version="1.0" standalone="no"?>
at the beginning of the file, then tika detects it as "image/svg+xml"
When i read the documentation i see that xml is not necessary for a file to be well-formed
https://www.w3.org/TR/REC-xml/#sec-prolog-dtd
It will be great if tika can detect a file as a SVG without the prolog
Attachments
Attachments
Issue Links
- links to