Uploaded image for project: 'Flink'
  1. Flink
  2. FLINK-34228

Add long UTF serializer/deserializer

    XMLWordPrintableJSON

Details

    • Improvement
    • Status: Closed
    • Major
    • Resolution: Fixed
    • None
    • 1.20.0
    • None

    Description

      DataOutputSerializer.writeUTF has a hard limit on the length of the string (64k). This is inherited from the DataOutput.writeUTF method, where the JDK specifically defines this limit [1].

      For our use-case we need to enable the possibility to serialize longer UTF strings, so we will need to define a writeLongUTF method with a similar specification than the writeUTF, but without the length limit.

      Based on the discussion on the mailing list, this is a good additional serialization utility to Flink [2]

      [1] - https://docs.oracle.com/javase/8/docs/api/java/io/DataOutput.html#writeUTF-java.lang.String-
      [2] - https://lists.apache.org/thread/ocm6cj0h8o3wbwo7fz2l1b4odss750rk

      Attachments

        Issue Links

          Activity

            People

              Unassigned Unassigned
              pvary Peter Vary
              Votes:
              0 Vote for this issue
              Watchers:
              2 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: