[FLINK-19896] Improve first-n-row fetching in the rank operator - ASF JIRA

XML

Word

Printable

JSON

Details

Type: Improvement
Status: Closed
Priority: Major
Resolution: Fixed
Affects Version/s: None
Fix Version/s: 1.14.0
Component/s: Table SQL / Runtime
Labels:
- auto-deprioritized-major
- pull-request-available

Description

Currently Deduplicate operator only supports first-row deduplication (ordered by proc-time). In scenario of first-n-rows deduplication, the planner has to resort to Rank operator. However, Rank operator is less efficient than Deduplicate due to larger state and more state access.

This issue proposes to extend DeduplicateKeepFirstRowFunction to support first-n-rows deduplication. And the original first-row deduplication would be a special case of first-n-rows deduplication.

Attachments

- Sort By Name
- Sort By Date
- Ascending
- Descending

flink-19896.patch
04/Nov/20 03:39
15 kB
Jun Zhang

Issue Links

links to

GitHub Pull Request #13921

Activity

People

Assignee:: Jun Zhang

Reporter:: Jun Zhang

Votes:: 0 Vote for this issue

Watchers:: 6 Start watching this issue

Dates

Created:: 30/Oct/20 09:00

Updated:: 28/Aug/21 12:14

Resolved:: 03/Jun/21 02:21