Details
-
Bug
-
Status: Open
-
Minor
-
Resolution: Unresolved
-
1.2
-
None
-
None
Description
The Mimetypes detector will return text/html as the mimetype for any javascript file that contains the string "<html" in it. I believe this is due to the rule <match value="<html" type="string" offset="0:8192"/> in the tika-mimetypes.xml file.
Attachments
Attachments
Issue Links
- causes
-
TIKA-3686 CSS file detected as JavaScript (application/javascript)
- Open