Uploaded image for project: 'Flink'
  1. Flink
  2. FLINK-3133

Introduce collect()/count()/print() methods in DataStream API

    XMLWordPrintableJSON

Details

    Description

      The DataSet API's methods collect(), count(), and print() should be mirrored to the DataStream API.

      The semantics of the calls are different. We need to be able to sample parts of a stream, e.g. by supplying a time period in the arguments to the methods. Users should use the JobClient to retrieve the results.

      StreamExecutionEnvironment env = StramEnvironment.getStreamExecutionEnvironment();
      
      DataStream<DataType> streamData = env.addSource(..).map(..);
      
      JobClient jobClient = env.executeWithControl();
      
      Iterable<DataType> sampled = jobClient.sampleStream(streamData, Time.seconds(5));
      

      Attachments

        Issue Links

          Activity

            People

              Unassigned Unassigned
              mxm Maximilian Michels
              Votes:
              0 Vote for this issue
              Watchers:
              10 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: