Uploaded image for project: 'Hive'
  1. Hive
  2. HIVE-3852

Multi-groupby optimization fails when same distinct column is used twice or more

    XMLWordPrintableJSON

Details

    • Bug
    • Status: Closed
    • Trivial
    • Resolution: Fixed
    • None
    • 0.11.0
    • Query Processor
    • None
    • Reviewed

    Description

      FROM INPUT
      INSERT OVERWRITE TABLE dest1 
      SELECT INPUT.key, sum(distinct substr(INPUT.value,5)), count(distinct substr(INPUT.value,5)) GROUP BY INPUT.key
      INSERT OVERWRITE TABLE dest2 
      SELECT INPUT.key, sum(distinct substr(INPUT.value,5)), avg(distinct substr(INPUT.value,5)) GROUP BY INPUT.key;
      

      fails with exception FAILED: IndexOutOfBoundsException Index: 0,Size: 0

      Attachments

        1. HIVE-3852.D7737.1.patch
          19 kB
          Phabricator

        Activity

          People

            navis Navis Ryu
            navis Navis Ryu
            Votes:
            0 Vote for this issue
            Watchers:
            6 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved: