Operator for managing the Spark clusters on Kubernetes and OpenShift.
APACHE-2.0 License
Haskell on Apache Spark.
A Python package to submit and manage Apache Spark applications on Kubernetes.
ETL pipeline using pyspark (Spark - Python)
The gateway component to make Spark on K8s much easier for Spark users.
Apache Spark docker image
A RPC framework leveraging Spark RPC module
Performance Observability for Apache Spark
A tool for monitoring and tuning Spark jobs for efficiency.
Kubernetes operator for managing the lifecycle of Apache Spark applications on Kubernetes.
scala、spark使用过程中,各种测试用例以及相关资料整理
This projects gives Kotlin bindings and several extensions for Apache Spark. We are looking to ha...
Spark examples
This repo contains examples of high throughput ingestion using Apache Spark and Apache Iceberg. T...
Apache Spark Kubernetes Operator
Apache Spark - A unified analytics engine for large-scale data processing