Uploaded image for project: 'Flink'
  1. Flink
  2. FLINK-29663

Further improvements of adaptive batch scheduler

    XMLWordPrintableJSON

Details

    • Improvement
    • Status: Closed
    • Major
    • Resolution: Done
    • None
    • 1.17.0
    • Runtime / Coordination
    • None
    • Hide
      In 1.17, we further enhanced the adaptive batch scheduler, mainly including:
      (1) Support for evenly distributing data to downstream tasks
      (2) Remove the limitation that the decided parallelism of vertices can only be a power of 2
      Show
      In 1.17, we further enhanced the adaptive batch scheduler, mainly including: (1) Support for evenly distributing data to downstream tasks (2) Remove the limitation that the decided parallelism of vertices can only be a power of 2

    Description

      In Flink 1.15, we introduced the adaptive batch scheduler to automatically decide parallelisms of job vertices for batch jobs.  In this issue, we will  further optimize it by changing the subpartition range division algorithm: change it from dividing according to the number of subpartitions(the number of subpartitions within each subpartition range is basically the same) to dividing according to the amount of data in subpartition ranges (the amount of data within each subpartition range is basically the same).

      More details see https://docs.google.com/document/d/1Qyq3qDkBCUNupajVJpFTKp3fHQJtwIu9luM7T52k1Oo

       This is the umbrella ticket for the improvements.

      Attachments

        Issue Links

          Activity

            People

              wanglijie Lijie Wang
              wanglijie Lijie Wang
              Votes:
              0 Vote for this issue
              Watchers:
              8 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: