Apache Spark is a fast and general cluster computing system. It provides high-level APIs in Scala, Java, Python and R, and an optimized engine that supports general computation graphs. It also supports a rich set of higher-level tools including Spark SQL for SQL and structured data processing, MLLib for machine learning, GraphX for graph processing, and Spark Streaming.
For more information, see:
The Spark Homepage
The Spark Wiki and How to Contribute to Spark
Spark's Github Repository

Activity Stream

Project Lead
matei Matei Zaharia
Last week most active
hyukjin.kwon jiangxb1987 smilegator actuaryzhang q79969786