Accompanying code examples for Apache Mahout: Beyond MapReduce. Distributed Algorithm Design.
The Internals of Apache Spark
MapReduce, Spark, Java, and Scala for Data Algorithms Book
scala、spark使用过程中,各种测试用例以及相关资料整理
Spark in Action, 2nd edition - chapter 1 - Introduction
Mirror of Apache Samoa (Incubating)
Mirror of Apache Mahout
Big Data Modeling, MapReduce, Spark, PySpark @ Santa Clara University
Learn how to use Spark SQL and HSpark connector package to create / query data tables that reside...
Spark extension for processing large-scale 3D data sets: Astrophysics, High Energy Physics, Meteo...
Apache Spark & Python (pySpark) tutorials for Big Data Analysis and Machine Learning as IPython /...
The Internals of Spark SQL
A repository to keep track of all the code that I end up writing for my blog posts.
PySpark-Tutorial provides basic algorithms using PySpark
The Internals of Spark Structured Streaming
Apache Ambari simplifies provisioning, managing, and monitoring of Apache Hadoop clusters.