Uploaded image for project: 'Flink'
  1. Flink
  2. FLINK-6022

Don't serialise Schema when serialising Avro GenericRecord

    XMLWordPrintableJSON

Details

    • Improvement
    • Status: Closed
    • Major
    • Resolution: Fixed
    • None
    • None
    • None

    Description

      Currently, Flink is serializing the schema for each Avro GenericRecord in the stream.
      This leads to a lot of overhead over the wire/disk + high serialization costs.

      Therefore, I'm proposing to improve the support for GenericRecord in Flink by shipping the schema to each serializer through the AvroTypeInformation.
      Then, we can only support GenericRecords with the same type per stream, but the performance will be much better.

      Attachments

        Issue Links

          Activity

            People

              sewen Stephan Ewen
              rmetzger Robert Metzger
              Votes:
              1 Vote for this issue
              Watchers:
              9 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: