Uploaded image for project: 'Apache Arrow'
  1. Apache Arrow
  2. ARROW-12301

[C++][Compute] Use generic hash-aggregate for DictionaryArrays

    XMLWordPrintableJSON

Details

    • Improvement
    • Status: Open
    • Major
    • Resolution: Unresolved
    • None
    • None
    • C++
    • None

    Description

      When calculating unique for chunked DictionaryArrays we currently run through all chunks and unify their dictionaries and then collect chunk indices. We could avoid the dictionary unification by using a generic hash.

      See discussion here and here

      Attachments

        Issue Links

          Activity

            People

              Unassigned Unassigned
              rokm Rok Mihevc
              Votes:
              0 Vote for this issue
              Watchers:
              3 Start watching this issue

              Dates

                Created:
                Updated: