Uploaded image for project: 'Hive'
  1. Hive
  2. HIVE-5657

TopN produces incorrect results with count(distinct)

    XMLWordPrintableJSON

Details

    • Bug
    • Status: Resolved
    • Critical
    • Resolution: Fixed
    • None
    • 0.13.0
    • None
    • None

    Description

      Attached patch illustrates the problem.
      limit_pushdown test has various other cases of aggregations and distincts, incl. count-distinct, that work correctly (that said, src dataset is bad for testing these things because every count, for example, produces one record only), so something must be special about this.
      I am not very familiar with distinct- code and these nuances; if someone knows a quick fix feel free to take this, otherwise I will probably start looking next week.

      Attachments

        1. D13797.1.patch
          47 kB
          Phabricator
        2. D13797.2.patch
          31 kB
          Phabricator
        3. example.patch
          2 kB
          Sergey Shelukhin
        4. HIVE-5657.02.patch
          55 kB
          Sergey Shelukhin
        5. HIVE-5657.03.patch
          56 kB
          Sergey Shelukhin
        6. HIVE-5657.1.patch.txt
          27 kB
          Navis Ryu

        Activity

          People

            sershe Sergey Shelukhin
            sershe Sergey Shelukhin
            Votes:
            0 Vote for this issue
            Watchers:
            3 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved: