Uploaded image for project: 'Flink'
  1. Flink
  2. FLINK-1038

Adding a collection output format

    XMLWordPrintableJSON

Details

    • New Feature
    • Status: Resolved
    • Minor
    • Resolution: Fixed
    • None
    • 0.7.0-incubating
    • None
    • None

    Description

      Similar to the existing LocalCollectionOutputFormat or Spark's collect() method, it would be nice to have a CollectionOutputFormat that also works when running jobs on a cluster. This output format gathers all results of a sink from all TaskManagers in the JVM that submitted the job plan and provides these as a collection, similar to accumulators. After all, this can help to avoid the tedious task of going to HDFS and read and parse the single result files.

      PS. We have already created such an output format and can contribute it.

      Attachments

        Activity

          People

            Unassigned Unassigned
            sekruse Sebastian Kruse
            Votes:
            0 Vote for this issue
            Watchers:
            5 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved: