Uploaded image for project: 'Flink'
  1. Flink
  2. FLINK-36270

DDB Streams Connector performance issue due to splitsAvailableForAssignment function

    XMLWordPrintableJSON

Details

    Description

      In DDB Streams connector, while testing we found out that when we are spending a lot of time in markAsFinished function because we are calling splitsAvailableForAssignment which is O(N), and given n shards can be marked as finished concurrently, the algorithm becomes O(n^2). Change the algo to assign only child shards when a parent is finished. We can start tracking child shards of a shard in SplitTracker

      Attachments

        Issue Links

          Activity

            People

              gguptp Abhi Gupta
              gguptp Abhi Gupta
              Votes:
              0 Vote for this issue
              Watchers:
              2 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: