Uploaded image for project: 'Hive'
  1. Hive
  2. HIVE-21218

KafkaSerDe doesn't support topics created via Confluent Avro serializer

    XMLWordPrintableJSON

    Details

    • Target Version/s:
    • Release Note:
      KafkaSerDe now support Confluent message format.

      Description

      According to Google groups the Confluent avro serialzier uses propertiary format for kafka value - <magic_byte 0x00><4 bytes of schema ID><regular avro bytes for object that conforms to schema>. 

      This format does not cause any problem for Confluent kafka deserializer which respect the format however for hive kafka handler its bit a problem to correctly deserialize kafka value, because Hive uses custom deserializer from bytes to objects and ignores kafka consumer ser/deser classes provided via table property.

      It would be nice to support Confluent format with magic byte.

      Also it would be great to support Schema registry as well.

        Attachments

        1. HIVE-21218.patch
          12 kB
          Milan Baran
        2. HIVE-21218.2.patch
          13 kB
          Milan Baran

          Issue Links

            Activity

              People

              • Assignee:
                milan.baran Milan Baran
                Reporter:
                milan.baran Milan Baran
              • Votes:
                2 Vote for this issue
                Watchers:
                4 Start watching this issue

                Dates

                • Created:
                  Updated:

                  Time Tracking

                  Estimated:
                  Original Estimate - Not Specified
                  Not Specified
                  Remaining:
                  Remaining Estimate - 0h
                  0h
                  Logged:
                  Time Spent - 3h 40m
                  3h 40m