Apache Spark is a fast and general cluster computing system. It provides high-level APIs in Scala, Java, Python and R, and an optimized engine that supports general computation graphs. It also supports a rich set of higher-level tools including Spark SQL for SQL and structured data processing, MLLib for machine learning, GraphX for graph processing, and Spark Streaming.
For more information, see:
The Spark Homepage
The Spark Wiki and How to Contribute to Spark
Spark's Github Repository
Project Lead
matei Matei Alexandru Zaharia
Last week most active
chengpan Wayne Guo mihailo.milosevic allisonwang-db podongfeng