Uploaded image for project: 'Spark'
  1. Spark
  2. SPARK-4366 Aggregation Improvement
  3. SPARK-11028

When planning queries without partial aggregation support, we should try to use TungstenAggregate.

    XMLWordPrintableJSON

Details

    • Sub-task
    • Status: Resolved
    • Major
    • Resolution: Incomplete
    • None
    • None
    • SQL

    Description

      With SPARK-11017, we can run DeclarativeAggregate Functions in TungstenAggregate. So, when we plan queries having functions that do not support partial aggregation, we can use TungstenAggregate whenever possible. The reason that we only use SortBasedAggregate is that HiveUDAF is the only function that does not support partial aggregation and it is a DeclarativeAggregate function.

      Attachments

        Activity

          People

            Unassigned Unassigned
            yhuai Yin Huai
            Votes:
            0 Vote for this issue
            Watchers:
            4 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved: