Uploaded image for project: 'Tika'
  1. Tika
  2. TIKA-3525

Allow users to configure skipping of unsupported charsets in charset detection

    XMLWordPrintableJSON

Details

    • Task
    • Status: Open
    • Minor
    • Resolution: Unresolved
    • None
    • None
    • None
    • None

    Description

      We've had two issues now where a charset detector is detecting a charset that is not supported by the jvm. In both cases, the charset detector was wrong. It is beyond our scope to fix the underlying charset detectors, but we can allow users to have the charset detectors skip unsupported charsets.

      Attachments

        Issue Links

          Activity

            People

              Unassigned Unassigned
              tallison Tim Allison
              Votes:
              0 Vote for this issue
              Watchers:
              1 Start watching this issue

              Dates

                Created:
                Updated: