A Spark Atlas connector to track data lineage in Apache Atlas
APACHE-2.0 License
Visualize column-level data lineage in Spark SQL
A simple Spark-powered ETL framework that just works 🍺
DataStax Connector for Apache Spark to Apache Cassandra
Make Structs Easy (MSE)
Fundamentals of Spark with Python (using PySpark), code examples
ETL pipeline using pyspark (Spark - Python)
This repo contains examples of high throughput ingestion using Apache Spark and Apache Iceberg. T...
Apache Spark - A unified analytics engine for large-scale data processing
Learn how to use Spark SQL and HSpark connector package to create / query data tables that reside...
Enabling Continuous Data Processing with Apache Spark and Azure Event Hubs
A Spark plugin for reading and writing Excel files
Extensible streaming ingestion pipeline on top of Apache Spark
This projects gives Kotlin bindings and several extensions for Apache Spark. We are looking to ha...
A tool for monitoring and tuning Spark jobs for efficiency.
Spark Structured Streaming State Tools