A lightweight operator to automatically expose Spark UI manage its ingress when running Spark on Kubernetes
MIT License
Apache Spark docker image
REST job server for Apache Spark
Operator for managing the Spark clusters on Kubernetes and OpenShift.
Master's thesis on Big Data
Jupyter magics and kernels for working with remote Spark clusters
End-to-end data pipeline that ingests, processes, and stores data. It uses Apache Airflow to sche...
Apache Spark Kubernetes Operator
A Python package to submit and manage Apache Spark applications on Kubernetes.
This construct builds some elements for you to quickly launch an EMR Serverless application. Afte...
The gateway component to make Spark on K8s much easier for Spark users.
Spark examples
Kubernetes operator for managing the lifecycle of Apache Spark applications on Kubernetes.
This repo contains examples of high throughput ingestion using Apache Spark and Apache Iceberg. T...
BigQuery data source for Apache Spark: Read data from BigQuery into DataFrames, write DataFrames ...
Apache Cloudstack Kubernetes Provider