Uploaded image for project: 'Apache Arrow'
  1. Apache Arrow
  2. ARROW-3978

[C++] Implement hashing, dictionary-encoding for StructArray

    XMLWordPrintableJSON

Details

    • New Feature
    • Status: Open
    • Major
    • Resolution: Unresolved
    • None
    • None
    • C++

    Description

      This is a central requirement for hash-aggregations such as

      SELECT AGG_FUNCTION(expr)
      FROM table
      GROUP BY expr1, expr2, ...
      

      The materialized keys in the GROUP BY section form a struct, which can be incrementally hashed to produce dictionary codes suitable for computing aggregates or any other purpose.

      There are a few subtasks related to this, such as efficiently constructing a record (that can be hashed quickly) to identify each "row" in the struct. Maybe we should start with that first

      Attachments

        Issue Links

          Activity

            People

              Unassigned Unassigned
              wesm Wes McKinney
              Votes:
              0 Vote for this issue
              Watchers:
              12 Start watching this issue

              Dates

                Created:
                Updated: