Uploaded image for project: 'Hive'
  1. Hive
  2. HIVE-13932

Hive SMB Map Join with small set of LIMIT failed with NPE

Log workAgile BoardRank to TopRank to BottomVotersWatch issueWatchersCreate sub-taskConvert to sub-taskMoveLinkCloneLabelsUpdate Comment AuthorReplace String in CommentUpdate Comment VisibilityDelete Comments
    XMLWordPrintableJSON

    Details

    • Type: Bug
    • Status: Closed
    • Priority: Major
    • Resolution: Fixed
    • Affects Version/s: 1.0.0, 2.0.0
    • Fix Version/s: 1.3.0, 2.1.1, 2.2.0
    • Component/s: None
    • Labels:
      None

      Description

      1) prepare sample data:
      a=1
      while [[ $a -lt 100 ]]; do echo $a ; let a=$a+1; done > data

      2) prepare source hive table:
      CREATE TABLE `s`(`c` string);
      load data local inpath 'data' into table s;

      3) prepare the bucketed table:
      set hive.enforce.bucketing=true;
      set hive.enforce.sorting=true;
      CREATE TABLE `t`(`c` string) CLUSTERED BY (c) SORTED BY (c) INTO 5 BUCKETS;
      insert into t select * from s;

      4) reproduce this issue:
      SET hive.auto.convert.sortmerge.join = true;
      SET hive.auto.convert.sortmerge.join.bigtable.selection.policy = org.apache.hadoop.hive.ql.optimizer.LeftmostBigTableSelectorForAutoSMJ;
      SET hive.auto.convert.sortmerge.join.noconditionaltask = true;
      SET hive.optimize.bucketmapjoin = true;
      SET hive.optimize.bucketmapjoin.sortedmerge = true;
      select * from t join t t1 on t.c=t1.c limit 1;

        Attachments

        1. HIVE-13932.1.patch
          5 kB
          Yongzhi Chen

          Activity

          $i18n.getText('security.level.explanation', $currentSelection) Viewable by All Users
          Cancel

            People

            • Assignee:
              ychena Yongzhi Chen Assign to me
              Reporter:
              ychena Yongzhi Chen

              Dates

              • Created:
                Updated:
                Resolved:

                Issue deployment