Apache Spark on AWS Lambda
APACHE-2.0 License
Spark examples
Learn how to use Spark SQL and HSpark connector package to create / query data tables that reside...
A Python package to submit and manage Apache Spark applications on Kubernetes.
A tool for monitoring and tuning Spark jobs for efficiency.
Open Source LeetCode for PySpark, Spark, Pandas and DBT/Snowflake
A guide on how to set up Jupyter with Pyspark painlessly on AWS EC2 clusters, with S3 I/O support
This construct builds some elements for you to quickly launch an EMR Serverless application. Afte...
ETL pipeline using pyspark (Spark - Python)
Easy to use library to bring Tensorflow on Apache Spark
This repo contains examples of high throughput ingestion using Apache Spark and Apache Iceberg. T...
Haskell on Apache Spark.
Apache Spark - A unified analytics engine for large-scale data processing
scala、spark使用过程中,各种测试用例以及相关资料整理
A simple Spark-powered ETL framework that just works 🍺
A RPC framework leveraging Spark RPC module