MapReduce, Spark, Java, and Scala for Data Algorithms Book
OTHER License
Spark in Action, 2nd edition - chapter 1 - Introduction
Practice tasks in Python programming language using Hadoop, MRJob, PySpark for Big Data Analytics.
The Internals of Apache Spark
Data analytics pipeline built with Apache Spark and Hadoop for processing and analyzing large-sca...
The Internals of Spark SQL
PySpark-Tutorial provides basic algorithms using PySpark
Big Data Modeling, MapReduce, Spark, PySpark @ Santa Clara University
Accompanying code examples for Apache Mahout: Beyond MapReduce. Distributed Algorithm Design.
Apache Spark Course Material
大数据知识仓库涉及到数据仓库建模、实时计算、大数据、数据中台、系统设计、Java、算法等。
This repo contains implementations of PySpark for real-world use cases for batch data processing,...
Big-Data with Apache Spark and Python.
Apache Spark™ and Scala Workshops
Scalable Data Science, course sets in big data Using Apache Spark over databricks and their mathe...
This is the github repo for Learning Spark: Lightning-Fast Data Analytics [2nd Edition]