Uploaded image for project: 'Hive'
  1. Hive
  2. HIVE-24990

Support distinct in window aggregation in vectorized mode

    XMLWordPrintableJSON

Details

    • Improvement
    • Status: Open
    • Major
    • Resolution: Unresolved
    • None
    • None
    • UDF, Vectorization
    • None

    Description

      PTF operator can not be vectorized if query has windowing function with distinct because this version of the aggregate functions are not implemented yet.

      SELECT sum(DISTINCT a) OVER (PARTITION BY b) FROM t1;
      

      The only exception is count.

      List of functions has vectorized version but does not have vectorized distinct version

      row_number
      rank
      dense_rank
      min
      max
      sum
      avg
      first_value
      last_value
      

      Attachments

        Activity

          People

            Unassigned Unassigned
            kkasa Krisztian Kasa
            Votes:
            0 Vote for this issue
            Watchers:
            2 Start watching this issue

            Dates

              Created:
              Updated: