Uploaded image for project: 'Flink'
  1. Flink
  2. FLINK-13244

Delayed Scheduler in Flink

Attach filesAttach ScreenshotAdd voteVotersWatch issueWatchersCreate sub-taskLinkCloneUpdate Comment AuthorReplace String in CommentUpdate Comment VisibilityDelete Comments
    XMLWordPrintableJSON

Details

    Description

      Currently with the flink scheduler and local splits

      • Suppose i use LocalSplitter to create splits for per hostname
      • Now a host requests for a local splits, there are chances that this node will get a local split but in case of no local split present , this node might get remote split and hence the data locality is of no use in these cases because there might be just some other node just round the corner who was just going to ask for its own local split but because this node asked for the split first , both the hosts/node will be given remote splits and hence the overall throughput of the system might decrease.
      • Proposal is to use Delayed scheduling. This has shown to be quite effective against these cases and might help us increase the overall throughput given the latency difference between execution of local and remote split is significant.
      • https://cs.stanford.edu/~matei/papers/2010/eurosys_delay_scheduling.pdf
        *

      Attachments

        Activity

          This comment will be Viewable by All Users Viewable by All Users
          Cancel

          People

            Unassigned Unassigned
            mridulv09 Mridul Verma

            Dates

              Created:
              Updated:

              Slack

                Issue deployment