[SPARK-43876] Enable fast hashmap for distinct queries - ASF JIRA

XML

Word

Printable

JSON

Details

Type: Improvement
Status: Resolved
Priority: Minor
Resolution: Fixed
Affects Version/s: 3.5.0
Fix Version/s: 3.5.0
Component/s: SQL
Labels:
None

Description

Spark will enable fast hash map for primitive data types in HashAggregateExec.

Could we also enable this for distinct queries which bufferSchema is empty.

For example, we can also build a fast hash map with the key a + b for query

 SELECT distinct a, b from tab

Attachments

Activity

People

Assignee:: Wan Kun

Reporter:: Wan Kun

Votes:: 0 Vote for this issue

Watchers:: 2 Start watching this issue

Dates

Created:: 30/May/23 03:41

Updated:: 20/Jun/23 17:14

Resolved:: 20/Jun/23 17:14