PySpark Cheat Sheet - example code to help you learn PySpark and develop apps faster
CC0-1.0 License
WASP is a framework to build complex real time big data applications. It relies on a kind of Kapp...
Spark data source for Cognite Data Fusion
Engine for ML/Data tracking, visualization, explainability, drift detection, and dashboards for P...
Make Structs Easy (MSE)
Routines and data structures for using isarn-sketches idiomatically in Apache Spark
🐍 Quick reference guide to common patterns & functions in PySpark.
OSM planet dump high performance data loader. Transform OpenStreetMap World/Region PBF dump into ...
A structured streaming was applied to the robot data from ROS-Gazebo simulation environment using...
pyspark methods to enhance developer productivity 📣 👯 🎉
Data Sketches for Apache Spark
The Almaren Framework provides a simplified consistent minimalistic layer over Apache Spark. Whil...
A COBOL parser and Mainframe/EBCDIC data source for Apache Spark
Rapid ETL/ELT-connectors/pipeline development leveraged on top of Apache Spark
A library for building structured LLM responses with Spark
A Spark SQL extension which provides SQL Standard Authorization for Apache Spark | This repo is c...