Uploaded image for project: 'Apache Avro'
  1. Apache Avro
  2. AVRO-1593

C++ json encoder assumes "C" locale and generates invalid UTF-8 sequence

    XMLWordPrintableJSON

Details

    • Bug
    • Status: Resolved
    • Critical
    • Resolution: Fixed
    • 1.7.7
    • 1.9.0
    • c++
    • None
    • windows-1252 encoding

    Description

      encoding a multibyte UTF-8 code point such as:

      "\xEF\xBD\x81"

      Incorrectly becomes:

      "\xEF\xBD\U0081"

      When encoded in the service running in the windows-1252 locale. This isnĀ¹t a valid UTF-8 sequence so we end up with Mojibake when reading back the JSON encoded string.

      Attachments

        Activity

          People

            hatem Hatem Helal
            hatem Hatem Helal
            Votes:
            0 Vote for this issue
            Watchers:
            8 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved: