Uploaded image for project: 'Hive'
  1. Hive
  2. HIVE-17253

Adding SUMMARY statement to HPL/SQL

    XMLWordPrintableJSON

Details

    • Improvement
    • Status: Closed
    • Major
    • Resolution: Fixed
    • None
    • 3.0.0
    • hpl/sql
    • None

    Description

      Adding SUMMARY statement to HPL/SQL to describe a data set (table, query result) similar to Python and R.

      For each column output the data type, number of distinct values, non-NULL rows, mean, std, percentiles, min, max. Output additional stats for categorical columns. This helps perform quick and easy exploratory data analysis for SQL devs and business users. http://hplsql.org/summary

      Attachments

        1. HIVE-17253.1.patch
          17 kB
          Dmitry Tolpeko

        Activity

          People

            dmtolpeko Dmitry Tolpeko
            dmtolpeko Dmitry Tolpeko
            Votes:
            0 Vote for this issue
            Watchers:
            3 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved: