Uploaded image for project: 'Flink'
  1. Flink
  2. FLINK-1618

Add parallel time discretisation for time-window transformations

    XMLWordPrintableJSON

Details

    • Improvement
    • Status: Resolved
    • Major
    • Resolution: Implemented
    • None
    • None
    • API / DataStream
    • None

    Description

      Currently discretizers for all windowing policies including time are executed with parallelism 1 when they define global windows. (for instance: sum of the last 10 minutes)

      While this is necessary for arbitrary policies like delta based or user-defined policies. Some discretizers such as Time can be implemented in a distributed fashion.

      Distributed time discretisers (and other types) can be implemented in the following way:

      -The discretisers should create StreamWindow s with incrementally increasing ID-s starting from the same value so that it is possible to merge them after the transformation

      • The partitioner for each discretizer should send the number of partitions created to the merger (the merger should be aware of the number of partitioners present to wait for all the information)
      • Based on all the partitioning info the merger can merge the windows properly afterwards

      Attachments

        Activity

          People

            gyfora Gyula Fora
            gyfora Gyula Fora
            Votes:
            0 Vote for this issue
            Watchers:
            2 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved: