Accompanying code examples for Apache Mahout: Beyond MapReduce. Distributed Algorithm Design.
Spark in Action, 2nd edition - chapter 1 - Introduction
scala、spark使用过程中,各种测试用例以及相关资料整理
A repository to keep track of all the code that I end up writing for my blog posts.
Mirror of Apache Mahout
The Internals of Apache Spark
Apache Ambari simplifies provisioning, managing, and monitoring of Apache Hadoop clusters.
The Internals of Spark Structured Streaming
The Internals of Spark SQL
Mirror of Apache Samoa (Incubating)
Spark extension for processing large-scale 3D data sets: Astrophysics, High Energy Physics, Meteo...
PySpark-Tutorial provides basic algorithms using PySpark
Learn how to use Spark SQL and HSpark connector package to create / query data tables that reside...
MapReduce, Spark, Java, and Scala for Data Algorithms Book
Big Data Modeling, MapReduce, Spark, PySpark @ Santa Clara University
Apache Spark & Python (pySpark) tutorials for Big Data Analysis and Machine Learning as IPython /...