Uploaded image for project: 'Flink'
  1. Flink
  2. FLINK-24898 Some further improvements of sort-shuffle
  3. FLINK-21790

Shuffle data directories to make directory selection of different TaskManagers fairer

    XMLWordPrintableJSON

Details

    Description

      Currently, different TaskManagers select data directory in the same order and if there are multiple disk, some disks may stores more data than others which is bad for performance. A simple improvement is that each TaskManager shuffles the given data directories randomly and select the data directory in different order.

      Attachments

        Issue Links

          Activity

            People

              tanyuxin Yuxin Tan
              kevin.cyj Yingjie Cao
              Votes:
              0 Vote for this issue
              Watchers:
              5 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: