pyspark methods to enhance developer productivity 📣 👯 🎉
APACHE-2.0 License
Apache Spark Connect Client for Rust
A whitespace formatter for different query languages
Quill for Scala 3
A library for building structured LLM responses with Spark
Data Sketches for Apache Spark
ORM for Apache Spark and DataFrames schema manager
Python SQL Parser and Transpiler
A library to transform Scala product types and Schemes from different systems into other Schemes....
The Almaren Framework provides a simplified consistent minimalistic layer over Apache Spark. Whil...
Engine for ML/Data tracking, visualization, explainability, drift detection, and dashboards for P...
Visualize column-level data lineage in Spark SQL
A unified interface for distributed computing. Fugue executes SQL, Python, Pandas, and Polars cod...
Make Structs Easy (MSE)
OSM planet dump high performance data loader. Transform OpenStreetMap World/Region PBF dump into ...
🐍 Quick reference guide to common patterns & functions in PySpark.