Uploaded image for project: 'Apache Avro'
  1. Apache Avro
  2. AVRO-1072

The JSON encoder doesn't handle non-ASCII character properly

    XMLWordPrintableJSON

Details

    • Bug
    • Status: Open
    • Major
    • Resolution: Unresolved
    • 1.6.3
    • None
    • java
    • None

    Description

      The JSON encoder uses default encoding of the platform. It should always use UTF-8.

      This causes multiple problems for us,

      1. The text is mangled if sending/receiving machine has different encoding.
      2. Some encodings (like Latin-1 or MacRoman) can't handle all characters (like Chinese) and we get ? in the text.
      3. The binary encoder (ByteBuffer) doesn't work due to this problem.

      Attachments

        Activity

          People

            Unassigned Unassigned
            zhihong@gmail.com Zhihong Zhang
            Votes:
            0 Vote for this issue
            Watchers:
            0 Start watching this issue

            Dates

              Created:
              Updated: