Uploaded image for project: 'Flink'
  1. Flink
  2. FLINK-8180

Refactor driver outputs

    XMLWordPrintableJSON

Details

    Description

      The change in 1.4 of algorithm results from Tuples to POJOs broke the writing of results as csv. Testing this was and is a challenge so was not done. There are many additional improvements which can be made based on recent improvements to the Gelly framework.

      Result hash and analytic results should always be printed to the screen. Results can optionally be written to stdout or to a file. In the latter case the result hash and analytic results (and schema) will also be written to a top-level file.

      The "verbose" output strings can be replaced with json which is just as human-readable but also machine readable. In addition to csv and json it may be simple to support xml, etc. Computed fields will be optionally printed to screen or file (currently these are always printed to screen but never to file).

      Testing will be simplified since formats are now a separate concern from the stream.

      Jackson is available to Gelly as a dependency provided in the Flink distribution but we may want to build flink-gelly-examples as an uber jar in order to include additional modules (which may require a direct dependency on Jackson, which would require checkstyle suppressions around the unshaded jackson imports).

      Attachments

        Activity

          People

            Unassigned Unassigned
            greghogan Greg Hogan
            Votes:
            0 Vote for this issue
            Watchers:
            2 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved: