[KAFKA-8410] Strengthen the types of Processors, at least in the DSL, maybe in the PAPI as well - ASF JIRA

XML

Word

Printable

JSON

Details

Type: Improvement
Status: Resolved
Priority: Major
Resolution: Fixed
Affects Version/s: None
Fix Version/s: 2.7.0
Component/s: streams
Labels:
- kip
- tech-debt

Description

KIP-478: https://cwiki.apache.org/confluence/display/KAFKA/KIP-478+-+Strongly+typed+Processor+API

Presently, it's very difficult to have confidence when adding to or modifying processors in the DSL. There's a lot of raw types, duck-typing, and casting that contribute to this problem.

The root, though, is that the generic types on `Processor<K,V>` refer only to the input key and value types. No information is captured or verified about what the output types of a processor are. For example, this leads to widespread confusion in the code base about whether a processor produces `V`s or `Change<V>`s. The type system actually makes matters worse, since we use casts to make the processors conform to declared types that are in fact wrong, but are never checked due to erasure.

We can start to make some headway on this tech debt by adding some types to the ProcessorContext that bound the `<K,V>` that may be passed to `context.forward`. Then, we can build on this by fully specifying the input and output types of the Processors, which in turn would let us eliminate the majority of unchecked casts in the DSL operators.

I'm not sure whether adding these generic types to the existing ProcessorContext and Processor interfaces, which would also affect the PAPI has any utility, or whether we should make this purely an internal change by introducing GenericProcessorContext and GenericProcessor peer interfaces for the DSL to use.

Attachments

Issue Links

is duplicated by

KAFKA-12532 Migrate Stream operators to new Processor API

Resolved

relates to

KAFKA-8396 Clean up Transformer API

Resolved

links to

GitHub Pull Request #6833

GitHub Pull Request #6856

GitHub Pull Request #8414

GitHub Pull Request #8595

GitHub Pull Request #9346

GitHub Pull Request #10381

GitHub Pull Request #10507

GitHub Pull Request #10744

GitHub Pull Request #10994

mentioned in: Page Loading...; Page Loading...

(6 links to, 2 mentioned in)

Sub-Tasks

1.	Introduce the KIP-478 processors with shims	Resolved	John Roesler
2.	Implement the KIP-478 StreamBuilder#addGlobalStore()	Resolved	John Roesler
3.	Implement KIP-478 Topology changes	Resolved	John Roesler
4.	KIP-478: Implement test-utils changes	Resolved	John Roesler
5.	KIP-478: Implement StateStoreContext and Record	Resolved	John Roesler
6.	KIP-478: Implement KStream changes	Resolved	John Roesler
7.	Convert KStreamImpl filters to new PAPI	Resolved	Jorge Esteban Quilcate Otoya
8.	Convert KStreamImpl maps to new PAPI	Resolved	Jorge Esteban Quilcate Otoya
9.	Convert KStreamImpl joins to new PAPI	Resolved	Jorge Esteban Quilcate Otoya
10.	Convert KStream aggregations to new PAPI	Resolved	Jorge Esteban Quilcate Otoya
11.	Convert KTable filters to new PAPI	Resolved	John Roesler
12.	Convert KTable suppress to new PAPI	Resolved	Jorge Esteban Quilcate Otoya
13.	Convert KTable maps to new PAPI	Resolved	Jorge Esteban Quilcate Otoya
14.	Convert KTable joins to new PAPI	Resolved	Jorge Esteban Quilcate Otoya
15.	Convert KTable aggregations to new PAPI	Resolved	Jorge Esteban Quilcate Otoya
16.	KIP-478: Deprecate the old PAPI interfaces	Resolved	John Roesler
17.	KIP-478: Delegate the store wrappers to the new init method	Resolved	John Roesler
18.	KIP-478: deprecate the replaced Processor API members	Resolved	John Roesler
19.	After migrating processors, search the codebase for missed migrations	Open	Unassigned
20.	After processors, migrate TupleForwarder and CacheFlushListener	Resolved	Jorge Esteban Quilcate Otoya

Activity

People

Assignee:: John Roesler

Reporter:: John Roesler

Votes:: 1 Vote for this issue

Watchers:: 9 Start watching this issue

Dates

Created:: 22/May/19 20:17

Updated:: 28/Dec/22 22:59

Resolved:: 08/Feb/22 02:18