Details
-
Task
-
Status: Open
-
Minor
-
Resolution: Unresolved
-
None
-
None
-
None
-
None
Description
We've had two issues now where a charset detector is detecting a charset that is not supported by the jvm. In both cases, the charset detector was wrong. It is beyond our scope to fix the underlying charset detectors, but we can allow users to have the charset detectors skip unsupported charsets.
Attachments
Issue Links
- is depended upon by
-
TIKA-3516 Unexpected charset IBM424_rtl detected for utf_8 file by CharsetDetector
- Resolved