MapReduce, Spark, Java, and Scala for Data Algorithms Book
OTHER License
No README available, please check again later.
This is the github repo for Learning Spark: Lightning-Fast Data Analytics [2nd Edition]
Spark in Action, 2nd edition - chapter 1 - Introduction
Apache Spark Course Material
Accompanying code examples for Apache Mahout: Beyond MapReduce. Distributed Algorithm Design.
Apache Spark™ and Scala Workshops
大数据知识仓库涉及到数据仓库建模、实时计算、大数据、数据中台、系统设计、Java、算法等。
Scalable Data Science, course sets in big data Using Apache Spark over databricks and their mathe...
Data analytics pipeline built with Apache Spark and Hadoop for processing and analyzing large-sca...
Practice tasks in Python programming language using Hadoop, MRJob, PySpark for Big Data Analytics.
Big Data Modeling, MapReduce, Spark, PySpark @ Santa Clara University
The Internals of Apache Spark
Big-Data with Apache Spark and Python.
This repo contains implementations of PySpark for real-world use cases for batch data processing,...
The Internals of Spark SQL
PySpark-Tutorial provides basic algorithms using PySpark