Avro Schema Evolution made easy
APACHE-2.0 License
Apache Parquet Format
Spark data source for Cognite Data Fusion
This repo contains examples of high throughput ingestion using Apache Spark and Apache Iceberg. T...
ORM for Apache Spark and DataFrames schema manager
Apache XTable (incubating) is a cross-table converter for lakehouse table formats that facilitate...
Apache Software Foundation Parent POM
A blazingly fast multi-language serialization framework powered by JIT and zero-copy.
Spark Connector to read and write with Pulsar
A simple Spark-powered ETL framework that just works 🍺
The Almaren Framework provides a simplified consistent minimalistic layer over Apache Spark. Whil...
A Spark Atlas connector to track data lineage in Apache Atlas
Harry for Apache Cassandra®
Rapid ETL/ELT-connectors/pipeline development leveraged on top of Apache Spark
OSM planet dump high performance data loader. Transform OpenStreetMap World/Region PBF dump into ...
Apache Sling Feature Model Analyser