Uploaded image for project: 'Apache Arrow'
  1. Apache Arrow
  2. ARROW-9842

[C++] Explore alternative strategy for Compare kernel implementation for better performance

    XMLWordPrintableJSON

Details

    • Improvement
    • Status: Open
    • Major
    • Resolution: Unresolved
    • None
    • None
    • C++
    • None

    Description

      The compiler may be able to vectorize comparison options if the bitpacking of results is deferred until the end (or in chunks). Instead, a temporary bytemap can be populated on a chunk-by-chunk basis and then the bytemaps can be bitpacked into the output buffer. This may also reduce the code size of the compare kernels (which are actually quite large at the moment)

      Attachments

        1. movemask.patch
          2 kB
          yibocai#1
        2. movemask-in-chunks.diff
          2 kB
          yibocai#1

        Activity

          People

            Unassigned Unassigned
            wesm Wes McKinney
            Votes:
            0 Vote for this issue
            Watchers:
            5 Start watching this issue

            Dates

              Created:
              Updated: