[KAFKA-5330] Use per-task converters in Connect - ASF JIRA

XML

Word

Printable

JSON

Details

Type: Improvement
Status: Resolved
Priority: Major
Resolution: Fixed
Affects Version/s: 0.11.0.0
Fix Version/s: 1.0.0
Component/s: connect
Labels:
None

Description

Because Connect started with a worker-wide model of data formats, we currently allocate a single Converter per worker and only allocate an independent one when the user overrides the converter.

This can lead to performance problems when the worker-level default converter is used by a large number of tasks because converters need to be threadsafe to support this model and they may spend a lot of time just on synchronization.

We could, instead, simply allocate one converter per task. There is some overhead involved, but generally it shouldn't be that large. For example, Confluent's Avro converters will each have their own schema cache and have to make their on calls to the schema registry API, but these are relatively small, likely inconsequential compared to any normal overhead we would already have for creating and managing each task.

Attachments

Issue Links

links to

GitHub Pull Request #3196

Activity

People

Assignee:: Unassigned

Reporter:: Ewen Cheslack-Postava

Reviewer:: Ewen Cheslack-Postava

Votes:: 0 Vote for this issue

Watchers:: 2 Start watching this issue

Dates

Created:: 26/May/17 05:14

Updated:: 22/Sep/17 03:13

Resolved:: 22/Sep/17 03:12

Time Tracking

Estimated:

24h

Remaining:

24h

Logged:

Not Specified