Details
-
Bug
-
Status: Open
-
Major
-
Resolution: Unresolved
-
1.3.0, 2.0.0
-
None
-
None
-
None
Description
I recently tried to write a test case for bucket map join in tez on ACID table but was unable to do so because bucket map join expects same number of files in table/partition as that of number of buckets.
Steps to reproduce:
create table src_acid(key string, value string) clustered by (key) into 4 buckets stored as orc tblproperties("transactional"="true");
insert into table src_acid values("key1","value1");
insert into table src_acid values("key2","value2");
After inserting 2 rows into the src_acid table, there will be total of 8 files in the table directory which does not match the number of buckets.