Apache Drill
  1. Apache Drill
  2. DRILL-361

Optimization for aggregation functions workspace


    • Type: Improvement Improvement
    • Status: Open
    • Priority: Minor Minor
    • Resolution: Unresolved
    • Affects Version/s: None
    • Fix Version/s: Future
    • Component/s: None
    • Labels:


      Currently, when an aggregation operator populates its outgoing record batch, it goes through the value vectors in its workspace and produces the output value.. for instance if the aggregate was AVG, the aggregate function workspace variables are sum and count, and we compute the sum/count to populate the output record. However, this is sub-optimal for the case where the aggregate function is simply doing a SUM or COUNT etc. In those cases, we should be able to directly transfer the entire workspace value vector to the outgoing batch and get better performance. We could maintain some property in the aggregate function itself that indicates whether it is a candidate for such an operation.


        Tony Stevenson made changes -
        Workflow no-reopen-closed, patch-avail, testing [ 12860534 ] Drill workflow [ 12935739 ]
        Jacques Nadeau made changes -
        Priority Major [ 3 ] Minor [ 4 ]
        Jacques Nadeau made changes -
        Issue Type Bug [ 1 ] Improvement [ 4 ]
        Jacques Nadeau made changes -
        Fix Version/s Future [ 12326743 ]
        Jake Farrell made changes -
        Field Original Value New Value
        Workflow no-reopen-closed, patch-avail [ 12841087 ] no-reopen-closed, patch-avail, testing [ 12860534 ]
        Aman Sinha created issue -


          • Assignee:
            Aman Sinha
          • Votes:
            0 Vote for this issue
            1 Start watching this issue


            • Created: