A simple Spark TDD example
MIT License
Statistics for this project are still being loaded, please check back later.
Learn how to use Spark SQL and HSpark connector package to create / query data tables that reside...
This repo contains implementations of PySpark for real-world use cases for batch data processing,...
Spark examples
Performance Observability for Apache Spark
A free tutorial for Apache Spark.
Basic framework utilities to quickly start writing production ready Apache Spark applications
A pure python mock of pyspark's RDD
pyspark🍒🥭 is delicious,just eat it!😋😋
This is the github repo for Learning Spark: Lightning-Fast Data Analytics [2nd Edition]
APACHE SPARK: Data Analysis, Transformation, and Visualisation with PySpark, IPL Data Analysis
Apache Spark - A unified analytics engine for large-scale data processing
TensorFlowOnSpark brings TensorFlow programs to Apache Spark clusters.
PySpark-Tutorial provides basic algorithms using PySpark
Apache Spark (PySpark) Practice on Real Data
Includes notes on using Apache Spark in general, notes on using Spark for Physics, how to run TPC...