Uploaded image for project: 'Apache Drill'
  1. Apache Drill
  2. DRILL-5691

multiple count distinct query planning error at physical phase

    XMLWordPrintableJSON

Details

    • Bug
    • Status: Open
    • Major
    • Resolution: Unresolved
    • 1.9.0, 1.10.0
    • None
    • None

    Description

      I materialized the count distinct query result in a cache , added a plugin rule to translate the (Aggregate、Aggregate、Project、Scan) or (Aggregate、Aggregate、Scan) to (Project、Scan) at the PARTITION_PRUNING phase. Then ,once user issue count distinct queries , it will be translated to query the cache to get the result.

      eg1: " select count,sum(a) ,count(distinct b) from t where dt=xx "
      eg2:"select count,sum(a) ,count(distinct b) ,count(distinct c) from t where dt=xxx "
      eg3:"select count(distinct b), count(distinct c) from t where dt=xxx"

      eg1 will be right and have a query result as I expected , but eg2 will be wrong at the physical phase.The error info is here: https://gist.github.com/weijietong/1b8ed12db9490bf006e8b3fe0ee52269.
      eg3 will also get the similar error.

      Attachments

        Issue Links

          Activity

            People

              Unassigned Unassigned
              weijie Weijie Tong
              Aman Sinha Aman Sinha
              Votes:
              0 Vote for this issue
              Watchers:
              2 Start watching this issue

              Dates

                Created:
                Updated: