Uploaded image for project: 'Apache Arrow'
  1. Apache Arrow
  2. ARROW-10043

[Rust] [DataFusion] Introduce support for DISTINCT by partially implementing COUNT(DISTINCT)

    XMLWordPrintableJSON

Details

    Description

      I am unsure where support for DISTINCT may be on the DataFusion roadmap, so I've filed this with the "Wish" type and "Minor" priority to reflect that this is a proposal:

      Introduce DISTINCT into DataFusion by partially implementing COUNT(DISTINCT). The ultimate goal is to fully support the DISTINCT keyword, but to get implementation started, limit the scope of this work to:

      • the COUNT() aggregate function
      • a single expression in COUNT(), i.e., COUNT(DISTINCT c1), but not COUNT(DISTINCT c1, c2)
      • only queries with a GROUP BY clause
      • integer types

      Attachments

        Issue Links

          Activity

            People

              drusso Daniel Russo
              drusso Daniel Russo
              Votes:
              1 Vote for this issue
              Watchers:
              4 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved:

                Time Tracking

                  Estimated:
                  Original Estimate - Not Specified
                  Not Specified
                  Remaining:
                  Remaining Estimate - 0h
                  0h
                  Logged:
                  Time Spent - 9h 20m
                  9h 20m