[HIVE-15899] Make CTAS with acid target table and insert into acid_tbl select ... union all ... work - ASF JIRA

XML

Word

Printable

JSON

Details

Type: Sub-task
Status: Closed
Priority: Major
Resolution: Fixed
Affects Version/s: None
Fix Version/s: 3.0.0
Component/s: Transactions
Labels:
None

Target Version/s:

3.0.0

Description

Consider:

create table T stored as ORC TBLPROPERTIES('transactional'='true') as
      select a, b from A where a <= 5 union all select a, b from B where a >= 5

and

create table T (a int, b int) stored as ORC  TBLPROPERTIES ('transactional'='false';
insert into T(a,b) select a, b from T where a between 1 and 3 group by a, b union all select a, b from A where a between 5 and 7 union all select a, b from B where a >= 9

On Tez, there is an optimization that removes Union All operator writes the data into
subdirectories of T (in this case T is unpartitioned).

This also happens on MR but requires

hiveConf.setBoolVar(HiveConf.ConfVars.HIVE_OPTIMIZE_UNION_REMOVE, true);
hiveConf.setVar(HiveConf.ConfVars.HIVEFETCHTASKCONVERSION, "none");

Need to ensure that when target table is Acid, we generate unique ROW__IDs
When target is not acid, that we can convert it to Acid via Alter Table even when data layout includes subdirectories.

Attachments

- Sort By Name
- Sort By Date
- Ascending
- Descending

HIVE-15899.01.patch
09/Sep/17 02:46
30 kB
Eugene Koifman
HIVE-15899.02.patch
09/Sep/17 16:02
30 kB
Eugene Koifman
HIVE-15899.03.patch
11/Sep/17 14:23
30 kB
Eugene Koifman
HIVE-15899.04.patch
12/Sep/17 02:26
43 kB
Eugene Koifman
HIVE-15899.05.patch
15/Sep/17 03:33
55 kB
Eugene Koifman
HIVE-15899.07.patch
16/Sep/17 00:33
68 kB
Eugene Koifman
HIVE-15899.08.patch
16/Sep/17 16:10
68 kB
Eugene Koifman
HIVE-15899.09.patch
16/Sep/17 19:22
69 kB
Eugene Koifman
HIVE-15899.10.patch
17/Sep/17 04:06
72 kB
Eugene Koifman
HIVE-15899.11.patch
19/Sep/17 00:33
80 kB
Eugene Koifman
HIVE-15899.12.patch
19/Sep/17 16:07
91 kB
Eugene Koifman
HIVE-15899.13.patch
19/Sep/17 22:51
91 kB
Eugene Koifman

Issue Links

is blocked by

HIVE-17204 support un-bucketed tables in acid

Resolved

is related to

HIVE-18021 Insert overwrite on acid table with Union All optimizations

Open

HIVE-16177 non Acid to acid conversion doesn't handle _copy_N files

Closed

links to

Review Board

Activity

People

Assignee:: Eugene Koifman

Reporter:: Eugene Koifman

Votes:: 0 Vote for this issue

Watchers:: 6 Start watching this issue

Dates

Created:: 13/Feb/17 22:38

Updated:: 22/May/18 23:59

Resolved:: 20/Sep/17 17:48