Practice tasks in Python programming language using Hadoop, MRJob, PySpark for Big Data Analytics.
MIT License
Official content for Harvard CS109
Code material for a data science tutorial
Apache Spark Machine Learning project using MLlib and Linear Regression on Databricks!
Some notebook examples related to Apache Spark, IPython / Jupyter, Zeppelin
This repository contains code files specifically IPython notebooks for the assignments in the cou...
DataHunt is a comprehensive collection of essential resources for data science
[WIP] Learning resources and practical tips on how to use Jupyter notebooks for fun & profit.