Uploaded image for project: 'Hive'
  1. Hive
  2. HIVE-11845

Bucket map join requires exact number of files as number of buckets

    XMLWordPrintableJSON

Details

    • Bug
    • Status: Open
    • Major
    • Resolution: Unresolved
    • 1.3.0, 2.0.0
    • None
    • None
    • None

    Description

      I recently tried to write a test case for bucket map join in tez on ACID table but was unable to do so because bucket map join expects same number of files in table/partition as that of number of buckets.

      Steps to reproduce:
      create table src_acid(key string, value string) clustered by (key) into 4 buckets stored as orc tblproperties("transactional"="true");
      insert into table src_acid values("key1","value1");
      insert into table src_acid values("key2","value2");

      After inserting 2 rows into the src_acid table, there will be total of 8 files in the table directory which does not match the number of buckets.

      Attachments

        Activity

          People

            vikram.dixit Vikram Dixit K
            prasanth_j Prasanth Jayachandran
            Votes:
            0 Vote for this issue
            Watchers:
            5 Start watching this issue

            Dates

              Created:
              Updated: