# Tutorial for playing with Mahout's Spark shell

## Description

I have a created a tutorial for setting up the spark shell and implementing a simple linear regression algorithm. I'd love to make this part of the website, could someone give it a review?

https://github.com/sscdotopen/krams/blob/master/linear-regression-cereals.md

PS: If you wanna try out the code, you have to add the patch from MAHOUT-1532 to your sources.

## Activity

Sebastian Schelter added a comment -

Updated tutorial to also mention caching.

Andrew Palumbo added a comment -

Sebastian Schelter, I followed the tutorial step by step and everything worked without any issues. I found it very easy to follow. The cut and paste ols example worked for me easily. Very nice!

Dmitriy Lyubimov added a comment -

This is super cool.

One note i would add is that X is tall and skinny (so that X'X fits in memory but X is not). Otherwise it looks like the real thing all hapens in-core but this is not really the case.

Dmitriy Lyubimov added a comment -

we also should probably modify shell so that this import is not needed.

import org.apache.mahout.math.Vector

Sebastian Schelter added a comment -

added to the website. I also added a new top navigation point called "Spark". Shout if you don't like that naming.

Dmitriy Lyubimov added a comment -

Sebastian Schelter do you mind if i rewrite the math symbols in latex/mathjax?

Sebastian Schelter added a comment -

No, go ahead, thats a great idea.

Dmitriy Lyubimov added a comment -

done in stage but for some reason it doesn't publish site for me. CMS infra problems again perhaps. Staging looks fine.

