Uploaded image for project: 'Flink'
  1. Flink
  2. FLINK-1621

Create a generalized combine function

    XMLWordPrintableJSON

Details

    Description

      Flink allows combiners which accept a type I and combine the values of this type into type O. In Google Dataflow, combiners are more generalized. They accept an Input I, produce an intermediate combine value of T, and finally an output O. Flink's combiners are like the SimpleCombineFn in Google Dataflow.

      Right now, we translate the KeyedCombineFn into a SortPartition followed by a MapPartition to emulate the Combiner's behavior. Rudimentary performance tests showed that this behavior causes a significant increase in run time compared to the proper Combine implementation.

      Let's implement a more generalized Combiner to create a better mapping from Google Dataflow to Flink.

      Attachments

        Issue Links

          Activity

            People

              mxm Maximilian Michels
              mxm Maximilian Michels
              Votes:
              0 Vote for this issue
              Watchers:
              1 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: