[SYSTEMDS-1009] Avoid spark context creation on parfor optimization - ASF JIRA

XML

Word

Printable

JSON

Details

Type: Improvement
Status: Closed
Priority: Major
Resolution: Fixed
Affects Version/s: None
Fix Version/s: SystemML 0.11
Component/s: None
Labels:
None

Description

Currently, every parfor script triggers the lazy spark context creation, independent of its input data size and script in order to obtain memory budgets and parallelism. On small data the the spark context creation dominates end-to-end execution time. We should improve this to a configuration-only analysis, which would avoid the context creation.

For example, here are the XS and S performance results for univariate statistics:

UnivariateStatistics on mbperftest/bivar/A_10k/data: 14
UnivariateStatistics on mbperftest/bivar/A_10k/data: 14
UnivariateStatistics on mbperftest/bivar/A_10k/data: 17
UnivariateStatistics on mbperftest/bivar/A_10k/data: 16

UnivariateStatistics on mbperftest/bivar/A_100k/data: 14
UnivariateStatistics on mbperftest/bivar/A_100k/data: 15
UnivariateStatistics on mbperftest/bivar/A_100k/data: 14
UnivariateStatistics on mbperftest/bivar/A_100k/data: 17

Attachments

Issue Links

Is contained by

SYSTEMDS-1010 Perftest 0.11 release and related improvements

Resolved

Activity

People

Assignee:: Matthias Boehm

Reporter:: Matthias Boehm

Votes:: 0 Vote for this issue

Watchers:: 1 Start watching this issue

Dates

Created:: 04/Oct/16 04:01

Updated:: 05/Oct/16 02:00

Resolved:: 05/Oct/16 00:40