Uploaded image for project: 'Flink'
  1. Flink
  2. FLINK-3664

Create a method to easily Summarize a DataSet

    XMLWordPrintableJSON

Details

    • Improvement
    • Status: Closed
    • Major
    • Resolution: Fixed
    • None
    • 1.1.0
    • None
    • None

    Description

      Here is an example:

      /**
       * Summarize a DataSet of Tuples by collecting single pass statistics for all columns
       */
      public Tuple summarize()
      
      Dataset<Tuple3<Double, String, Boolean>> input = // [...]
      Tuple3<DoubleColumnSummary,StringColumnSummary,BooleanColumnSummary> summary = input.summarize()
      
      summary.getField(0).stddev()
      summary.getField(1).maxStringLength()
      
      

      Attachments

        Activity

          People

            tlisonbee Todd Lisonbee
            tlisonbee Todd Lisonbee
            Votes:
            0 Vote for this issue
            Watchers:
            6 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved: