Uploaded image for project: 'Hive'
  1. Hive
  2. HIVE-9112

Query may generate different results depending on the number of reducers

Log workAgile BoardRank to TopRank to BottomBulk Copy AttachmentsBulk Move AttachmentsVotersWatch issueWatchersCreate sub-taskConvert to sub-taskMoveLinkCloneLabelsUpdate Comment AuthorReplace String in CommentUpdate Comment VisibilityDelete Comments
    XMLWordPrintableJSON

Details

    • Bug
    • Status: Closed
    • Major
    • Resolution: Fixed
    • 0.14.0
    • 1.0.0
    • Logical Optimizer
    • None

    Description

      Some queries may generate different results depending on the number of reducers, for example, tests like ppd_multi_insert.q, join_nullsafe.q, subquery_in.q, etc.

      Take subquery_in.q as example, if we add

      set mapred.reduce.tasks=3;
      

      to this test file, the result will be different (and wrong):

      @@ -903,5 +903,3 @@ where li.l_linenumber = 1 and
       POSTHOOK: type: QUERY
       POSTHOOK: Input: default@lineitem
       #### A masked pattern was here ####
      -108570 8571
      -4297   1798
      

      Attachments

        1. HIVE-9112.1.patch
          47 kB
          Ted Xu
        2. HIVE-9112.2.patch
          68 kB
          Chao Sun
        3. HIVE-9112.patch
          33 kB
          Ted Xu
        4. HIVE-9112-0.14.1-branch.patch
          37 kB
          Ashutosh Chauhan

        Issue Links

        Activity

          This comment will be Viewable by All Users Viewable by All Users
          Cancel

          People

            tedxu Ted Xu Assign to me
            csun Chao Sun
            Votes:
            0 Vote for this issue
            Watchers:
            8 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved:

              Slack

                Issue deployment