[SPARK-12950] Improve performance of BytesToBytesMap - ASF JIRA

XML

Word

Printable

JSON

Details

Type: Improvement
Status: Resolved
Priority: Major
Resolution: Fixed
Affects Version/s: None
Fix Version/s: 2.0.0
Component/s: SQL
Labels:
None

Epic Link:
Whole stage codegen

Description

When benchmark generated aggregate with grouping keys, the profiling show that lookup in BytesToBytesMap took about 90% of the CPU time, we should optimize it.

After profiling with jvisualvm, here are the things that take most of the time:

1. decode address from Long to baseObject and offset
2. calculate hash code
3. compare the bytes (equality check)

Attachments

Issue Links

links to

[Github] Pull Request #10877 (viirya)

[Github] Pull Request #11002 (viirya)

[Github] Pull Request #11010 (davies)

Activity

People

Assignee:: Davies Liu

Reporter:: Davies Liu

Votes:: 0 Vote for this issue

Watchers:: 4 Start watching this issue

Dates

Created:: 21/Jan/16 05:43

Updated:: 11/Feb/16 09:44

Resolved:: 10/Feb/16 00:44