Uploaded image for project: 'Apache Quickstep'
  1. Apache Quickstep
  2. QUICKSTEP-28

Make multiple aggregations work on a common hash table

    XMLWordPrintableJSON

Details

    Description

      Currently each aggregation operation uses a separate hash table. This has the following disadvantages:

      • Multiple probes happen for the same key, once per hash table.
      • Space for keys is duplicated across hash tables.
      • Mutexes are acquired once per aggregation operation for each value.

      A more efficient design is to have a common hash table where the hash table payload for each key is partitioned among multiple aggregation handles.
      Changes are needed both to the aggregate and merge operations.

      Preliminary experiments suggest up to 3x speedup for TPC-H Q1 that has 8 aggregation operations.

      Attachments

        Issue Links

          Activity

            People

              Unassigned Unassigned
              rathijit Rathijit Sen
              Votes:
              0 Vote for this issue
              Watchers:
              2 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: