Uploaded image for project: 'Apache Arrow'
  1. Apache Arrow
  2. ARROW-12873

[C++][Compute] Support tagging ExecBatches with arbitrary extra information

    XMLWordPrintableJSON

Details

    • Improvement
    • Status: Open
    • Major
    • Resolution: Unresolved
    • None
    • None
    • C++

    Description

      Ideally, ExecBatches could be tagged with arbitrary optional objects for tracing purposes and to transmit execution hints from one ExecNode to another.

      These should not be explicit members like ExecBatch::selection_vector is, since they may not originate from the arrow library. For an example within the arrow project: libarrow_dataset will be used to produce ScanNodes and a WriteNodes and it's useful to tag scanned batches with their Fragment of origin. However adding ExecBatch::fragment would result in a cyclic dependency.

      To facilitate this tagging capability, we would need a type erased container something like

      struct AnySet {
        void* Get(tag_t tag);
        void Set(tag_t tag, void* value, FnOnce<void(void*)> destructor);
      };
      

      Attachments

        Issue Links

          Activity

            People

              Unassigned Unassigned
              bkietz Ben Kietzman
              Votes:
              0 Vote for this issue
              Watchers:
              11 Start watching this issue

              Dates

                Created:
                Updated: