Uploaded image for project: 'Spark'
  1. Spark
  2. SPARK-43876

Enable fast hashmap for distinct queries

    XMLWordPrintableJSON

Details

    • Improvement
    • Status: Resolved
    • Minor
    • Resolution: Fixed
    • 3.5.0
    • 3.5.0
    • SQL
    • None

    Description

      Spark will enable fast hash map for primitive data types in HashAggregateExec.

      Could we also enable this for distinct queries which bufferSchema is empty.

      For example, we can also build a fast hash map with the key a + b for query

       SELECT distinct a, b from tab 

      Attachments

        Activity

          People

            wankun Wan Kun
            wankun Wan Kun
            Votes:
            0 Vote for this issue
            Watchers:
            2 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved: