Uploaded image for project: 'Hive'
  1. Hive
  2. HIVE-7780

Query with OVER clause return duplicate results[Spark Branch]

    XMLWordPrintableJSON

Details

    • Bug
    • Status: Resolved
    • Major
    • Resolution: Not A Problem
    • None
    • None
    • Spark
    • None

    Description

      A simple query with the OVER clause return duplicate results.

      hive> select address, count(id) over(partition by address) from test;
      Query ID = root_20140819150000_f5506fcc-4950-424b-a134-56fc5b06d6eb
      Total jobs = 1
      Launching Job 1 out of 1
      Number of reduce tasks determined at compile time: 1
      In order to change the average load for a reducer (in bytes):
        set hive.exec.reducers.bytes.per.reducer=<number>
      In order to limit the maximum number of reducers:
        set hive.exec.reducers.max=<number>
      In order to set a constant number of reducers:
        set mapreduce.job.reduces=<number>
      OK
      QD	1
      SH	2
      SH	2
      SZ	2
      SZ	2
      

      Attachments

        Issue Links

          Activity

            People

              chengxiang li Chengxiang Li
              chengxiang li Chengxiang Li
              Votes:
              0 Vote for this issue
              Watchers:
              1 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: