Details
-
Improvement
-
Status: Closed
-
Major
-
Resolution: Fixed
-
None
-
None
-
None
Description
Currently, Flink is serializing the schema for each Avro GenericRecord in the stream.
This leads to a lot of overhead over the wire/disk + high serialization costs.
Therefore, I'm proposing to improve the support for GenericRecord in Flink by shipping the schema to each serializer through the AvroTypeInformation.
Then, we can only support GenericRecords with the same type per stream, but the performance will be much better.
Attachments
Issue Links
- links to