[SPARK-21688] performance improvement in mllib SVM with native BLAS - ASF JIRA

XML

Word

Printable

JSON

Details

Type: Improvement
Status: Resolved
Priority: Minor
Resolution: Won't Fix
Affects Version/s: 2.2.0
Fix Version/s: None
Component/s: MLlib
Labels:
None
Environment:

4 nodes: 1 master node, 3 worker nodes
model name : Intel(R) Xeon(R) CPU E5-2697 v2 @ 2.70GHz
Memory : 180G
num of core per node: 10

Description

in current mllib SVM implementation, we found that the CPU is not fully utilized, one reason is that f2j blas is set to be used in the HingeGradient computation. As we found out earlier (https://issues.apache.org/jira/browse/SPARK-21305) that with proper settings, native blas is generally better than f2j on the uni-test level, here we make the blas operations in SVM go with MKL blas and get an end to end performance report showing that in most cases native blas outperformance f2j blas up to 50%.
So, we suggest removing those f2j-fixed calling and going for native blas if available. If this proposal is acceptable, we will move on to benchmark other algorithms impacted.

Attachments

- Sort By Name
- Sort By Date
- Ascending
- Descending

svm-mkl-2.png
10/Aug/17 06:11
419 kB
Vincent
svm-mkl-1.png
10/Aug/17 06:11
290 kB
Vincent
svm2.png
10/Aug/17 06:11
404 kB
Vincent
svm1.png
10/Aug/17 06:11
295 kB
Vincent
native-trywait.png
10/Aug/17 08:25
416 kB
Vincent
mllib svm training.png
10/Aug/17 05:45
7 kB
Vincent
ddot unitest.png
10/Aug/17 08:13
4 kB
Vincent

Issue Links

relates to

SPARK-21305 The BKM (best known methods) of using native BLAS to improvement ML/MLLIB performance

Resolved

links to

[Github] Pull Request #18936 (VinceShieh)

GitHub Pull Request #18936

Activity

People

Assignee:: Unassigned

Reporter:: Vincent

Votes:: 0 Vote for this issue

Watchers:: 3 Start watching this issue

Dates

Created:: 10/Aug/17 05:40

Updated:: 28/Dec/18 16:07

Resolved:: 28/Dec/18 16:07