[HIVE-9277] Hybrid Hybrid Grace Hash Join - ASF JIRA

XML

Word

Printable

JSON

We are proposing an enhanced hash join algorithm called “hybrid hybrid grace hash join”.

We can benefit from this feature as illustrated below:

The query will not fail even if the estimated memory requirement is slightly wrong
Expensive garbage collection overhead can be avoided when hash table grows
Join execution using a Map join operator even though the small table doesn't fit in memory as spilling some data from the build and probe sides will still be cheaper than having to shuffle the large fact table

The design was based on Hadoop’s parallel processing capability and significant amount of memory available.

High-leveldesignforHybridHybridGraceHashJoinv1.0.pdf
21/Jan/15 18:41
453 kB
Wei Zheng
HIVE-9277.01.patch
20/Feb/15 06:48
89 kB
Wei Zheng
HIVE-9277.02.patch
20/Feb/15 06:59
86 kB
Wei Zheng
HIVE-9277.03.patch
21/Feb/15 01:17
84 kB
Wei Zheng
HIVE-9277.04.patch
26/Feb/15 23:51
95 kB
Wei Zheng
HIVE-9277.05.patch
28/Feb/15 01:01
87 kB
Wei Zheng
HIVE-9277.06.patch
02/Mar/15 20:22
86 kB
Wei Zheng
HIVE-9277.07.patch
04/Mar/15 07:22
88 kB
Wei Zheng
HIVE-9277.08.patch
12/Mar/15 01:01
114 kB
Wei Zheng
HIVE-9277.13.patch
18/Mar/15 23:09
138 kB
Wei Zheng
HIVE-9277.14.patch
19/Mar/15 21:48
138 kB
Wei Zheng
HIVE-9277.15.patch
21/Mar/15 01:25
138 kB
Wei Zheng

is related to

HIVE-10287 Implement Hybrid Hybrid Grace Hash Join for Spark Branch [Spark Branch]

HIVE-10123 Hybrid grace Hash join : Use estimate key count from stats to initialize BytesBytesMultiHashMap

HIVE-10284 enable container reuse for grace hash join

HIVE-9789 Hybrid Hybrid Grace Hash Join: improve hashtable serialization

HIVE-9790 Hybrid Hybrid Grace Hash Join: improve side file serialization

HIVE-10072 Add vectorization support for Hybrid Grace Hash Join

HIVE-10403 Add n-way join support for Hybrid Grace Hash Join

is required by

HIVE-11306 Add a bloom-1 filter for Hybrid MapJoin spills

links to

Review Board

(2 is related to, 1 is required by, 1 links to)