Uploaded image for project: 'Flink'
  1. Flink
  2. FLINK-19896

Improve first-n-row fetching in the rank operator

    XMLWordPrintableJSON

Details

    Description

      Currently Deduplicate operator only supports first-row deduplication (ordered by proc-time). In scenario of first-n-rows deduplication, the planner has to resort to Rank operator. However, Rank operator is less efficient than Deduplicate due to larger state and more state access.

      This issue proposes to extend DeduplicateKeepFirstRowFunction to support first-n-rows deduplication. And the original first-row deduplication would be a special case of first-n-rows deduplication.

      Attachments

        1. flink-19896.patch
          15 kB
          Jun Zhang

        Issue Links

          Activity

            People

              junz Jun Zhang
              junz Jun Zhang
              Votes:
              0 Vote for this issue
              Watchers:
              6 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: