A Spark library for Amazon SageMaker.
APACHE-2.0 License
WASP is a framework to build complex real time big data applications. It relies on a kind of Kapp...
Google BigQuery support for Spark, Structured Streaming, SQL, and DataFrames with easy Databricks...
A Spark plugin for reading and writing Excel files
Simple and Distributed Machine Learning
Fundamentals of Spark with Python (using PySpark), code examples
Service for extracting tables from the CCAO system-of-record and uploading them to the Data Depar...
ETL pipeline using pyspark (Spark - Python)
This construct builds some elements for you to quickly launch an EMR Serverless application. Afte...
A library for building structured LLM responses with Spark
Spark data source for Cognite Data Fusion
Policy diffusion in the US legislature
The Almaren Framework provides a simplified consistent minimalistic layer over Apache Spark. Whil...
Apache Spark Machine Learning project using MLlib and Linear Regression on Databricks!
REST web service for the true real-time scoring (<1 ms) of Scikit-Learn, R and Apache Spark models
This repo contains examples of high throughput ingestion using Apache Spark and Apache Iceberg. T...