[SYSTEMDS-845] Compare Performance of LeNet Scripts With & Without Using SystemML-NN - ASF JIRA

XML

Word

Printable

JSON

Details

Type: Improvement
Status: Closed
Priority: Major
Resolution: Fixed
Affects Version/s: SystemML 0.11
Fix Version/s: SystemML 0.11
Component/s: Algorithms, Compiler
Labels:
None

Description

This JIRA issue tracks the comparison of the performance of the LeNet scripts with & without using SystemML-NN. The goal is that they should have equal performance in terms of both accuracy and time. Any difference will be indicate areas of engine improvement.

Scripts:

mnist_lenet-train.dml - LeNet script that does use the SystemML-NN library.
lenet-train.dml - LeNet script that does not use the SystemML-NN library.

Current Status - Forced Singlenode:

Equal performance when running the scripts in standalone mode with the -exec singlenode flag, 20GB of memory, and using data inputs in the SystemML binary format – see run.sh and perf.sh for information.

Results:

Run #1:

Script Time (s) Accuracy

mnist_lenet-train.dml 2987.400704441 99.32%

lenet-train.dml 2816.369435579 99.28%

Run #2:

Script Time (s) Accuracy

mnist_lenet-train.dml 2847.790531812 99.16%

lenet-train.dml 2950.520494210 99.18%

So, same accuracy, and same runtime in singlenode mode!

Current Status - Spark Local:

The two scripts now have the same performance in Spark local mode (non-singlenode), equivalent to the performance in forced singlenode mode due to the creation of only CP jobs!

—

To fully reproduce, I basically created a directory, placed the two attached bash scripts in it, grabbed a copy of the NN library and placed it into the directory, ran the examples/get_mnist_data.sh script from the library to get the data (placed into examples/data), then used the attached convert.dml to create binary copies of the data for both scripts, then ran run.sh. Also, I copied examples/data to the base directory as well. Adjust the EXEC and related variables in perf.sh to switch between standalone, Spark, memory sizes, explain, stats, etc.

Attachments

- Sort By Name
- Sort By Date
- Ascending
- Descending

convert.dml
04/Aug/16 21:12
0.4 kB
Mike Dusenberry
lenet-train-spark-explain.log
05/Aug/16 00:14
55 kB
Mike Dusenberry
lenet-train-spark-explain-recompile-hops.log
06/Aug/16 00:17
831 kB
Mike Dusenberry
log08.03.16-1470268602.txt
04/Aug/16 21:12
40 kB
Mike Dusenberry
mnist_lenet-train-spark-explain.log
05/Aug/16 00:14
86 kB
Mike Dusenberry
mnist_lenet-train-spark-explain-recompile-hops.log
06/Aug/16 00:17
3.59 MB
Mike Dusenberry
perf.sh
04/Aug/16 21:12
2 kB
Mike Dusenberry
run.sh
04/Aug/16 21:12
0.2 kB
Mike Dusenberry

Issue Links

relates to

SYSTEMDS-618 Deep Learning DML Library

In Progress

Activity

People

Assignee:: Mike Dusenberry

Reporter:: Mike Dusenberry

Votes:: 0 Vote for this issue

Watchers:: 3 Start watching this issue

Dates

Created:: 04/Aug/16 21:01

Updated:: 01/Sep/16 22:27

Resolved:: 01/Sep/16 22:24

Script	Time (s)	Accuracy
mnist_lenet-train.dml	2987.400704441	99.32%
lenet-train.dml	2816.369435579	99.28%