Uploaded image for project: 'Flink'
  1. Flink
  2. FLINK-28374 Some further improvements of blocking shuffle
  3. FLINK-28551

Store the number of bytes instead of the number of buffers in index entry for sort-shuffle

    XMLWordPrintableJSON

Details

    Description

      Currently, in each index entry of sort-shuffle index file, one filed is the number of buffers in the current data region. The problem is that it is hard to know the data boundary before reading the file, to solve the problem, we can store the number of bytes instead of the number of buffers in index entry. Based on this change, we can do some optimization, for example, read larger size of data than a buffer for better sequential IO like what's mentioned in FLINK-28373.

      Attachments

        Issue Links

          Activity

            People

              tanyuxin Yuxin Tan
              kevin.cyj Yingjie Cao
              Votes:
              0 Vote for this issue
              Watchers:
              2 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: