Big Data Modeling, MapReduce, Spark, PySpark @ Santa Clara University
Apache Spark & Python (pySpark) tutorials for Big Data Analysis and Machine Learning as IPython /...
Notes on Apache Spark (pyspark)
Use dplyr to analyze Big Data
A repository to keep track of all the code that I end up writing for my blog posts.
PySpark-Tutorial provides basic algorithms using PySpark
Practice tasks in Python programming language using Hadoop, MRJob, PySpark for Big Data Analytics.
Big-Data with Apache Spark and Python.
大数据知识仓库涉及到数据仓库建模、实时计算、大数据、数据中台、系统设计、Java、算法等。
Apache Spark Machine Learning project using MLlib and Linear Regression on Databricks!
Fundamentals of Spark with Python (using PySpark), code examples
Scalable Data Science, course sets in big data Using Apache Spark over databricks and their mathe...
MapReduce, Spark, Java, and Scala for Data Algorithms Book
Data analytics pipeline built with Apache Spark and Hadoop for processing and analyzing large-sca...
Implementing core components of a data-driven architecture using Spark: Data Management and Data ...
Accompanying code examples for Apache Mahout: Beyond MapReduce. Distributed Algorithm Design.