Uploaded image for project: 'Apache Jena'
  1. Apache Jena
  2. JENA-2118

Change IO.asUTF8 to map bad characters to U+FFFD not throw an exception.

    XMLWordPrintableJSON

Details

    • Bug
    • Status: Closed
    • Major
    • Resolution: Fixed
    • Jena 4.1.0
    • Jena 4.2.0
    • Base, RIOT
    • None

    Description

      IO.asUTF8 uses an unmodified CharsetDecoder created from StandardCharsets.UTF_8 passed to JDK InputStreamReader. It has "unmappable" set to REPORT.

      But if InputStreamReader is passed the charset, it creates then modifies to get a CharsetDecoder with unmappable set to "REPLACE" (i.e. use U+FFFD, the unicode "unmappable" character.)

      Change the behaviour to be "REPLACE".

      Attachments

        Issue Links

          Activity

            People

              andy Andy Seaborne
              andy Andy Seaborne
              Votes:
              0 Vote for this issue
              Watchers:
              2 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: