Uploaded image for project: 'Hive'
  1. Hive
  2. HIVE-9052

Missing grouping rows when multi-insert

    XMLWordPrintableJSON

Details

    • Bug
    • Status: Open
    • Major
    • Resolution: Unresolved
    • 0.14.0
    • None
    • None
    • None

    Description

      Giving a table and data as below:

      create table score (class string, student string, score int) ROW FORMAT DELIMITED FIELDS TERMINATED BY ',' ;
      -----------------Data--------------
      class1,Jack,7
      class1,Mike,8
      class2,Tom,7

      The HQL 'from score INSERT OVERWRITE DIRECTORY '/tmp/dpp/hql1' select class,student , count(score) group by class, student grouping sets ((class), (class,student)) '
      will get result like :
      ---------hql1-------------
      class1\N2
      class1Jack1
      class1Mike1
      class2\N1
      class2Tom1

      And the HQL 'from score INSERT OVERWRITE DIRECTORY '/tmp/dpp/hql2' select class,student , sum(score) group by class, student grouping sets ((class), (class,student)) '
      will get result like :
      ---------hql2-------------
      class1\N15
      class1Jack7
      class1Mike8
      class2\N7
      class2Tom7

      But, if you run the HQL with above two inserts, 'from score INSERT OVERWRITE DIRECTORY '/tmp/dpp/hql1' select class,student , count(score) group by class, student grouping sets ((class), (class,student)) INSERT OVERWRITE DIRECTORY '/tmp/dpp/hql2' select class,student , sum(score) group by class, student grouping sets ((class), (class,student))'
      , the results will miss some grouping rows as below:

      ---------hql1-------------
      class1Jack1
      class1Mike1
      class2Tom1

      ---------hql2-------------
      class1Jack7
      class1Mike8
      class2Tom7

      Attachments

        Activity

          People

            Unassigned Unassigned
            Jet Jet Guo
            Votes:
            0 Vote for this issue
            Watchers:
            1 Start watching this issue

            Dates

              Created:
              Updated: