Euphoria is an open source Java API for creating unified big-data processing flows. It provides an engine independent programming model which can express both batch and stream transformations.
APACHE-2.0 License
Apache Flink
Visualize column-level data lineage in Spark SQL
Nessie: Transactional Catalog for Data Lakes with Git-like semantics
Flowman is an ETL framework powered by Apache Spark. With its declarative approach, Flowman simpl...
Basic framework utilities to quickly start writing production ready Apache Spark applications
Haskell on Apache Spark.
A cohesive & pragmatic framework of FP centric Scala libraries
Apache Software Foundation Parent POM
This projects gives Kotlin bindings and several extensions for Apache Spark. We are looking to ha...
Apache Flink Stateful Functions
A unified interface for distributed computing. Fugue executes SQL, Python, Pandas, and Polars cod...
Morpheus brings the leading graph query language, Cypher, onto the leading distributed processing...
Apache Spark - A unified analytics engine for large-scale data processing
Apache Pekko Kafka Connector - Pekko-Connectors is a Reactive Enterprise Integration library for ...
Some Data Science examples using Groovy