Taier is a big data development platform for submission, scheduling, operation and maintenance, and indicator information display
APACHE-2.0 License
EtlFlow is an ecosystem of functional libraries in Scala based on ZIO for running complex Auditab...
A full-featured license tool to check and fix license headers and resolve dependencies' licenses.
🧙 Build, run, and manage data pipelines for integrating and transforming data.
Apache Maven Doxia Sitetools
High performance data store solution
Apache Software Foundation Parent POM
Apache IoTDB
Terraform module to provision an Elastic MapReduce (EMR) cluster on AWS
Apache Druid: a high performance real-time analytics database.
Data Accelerator for Apache Spark simplifies onboarding to Streaming of Big Data. It offers a ric...
An open-source storage framework that enables building a Lakehouse architecture with compute engi...
TiSpark is built for running Apache Spark on top of TiDB/TiKV
Dataproc templates and pipelines for solving simple in-cloud data tasks
A Flexible, Fast, Federated(3F) SQL Analysis Middleware for Multiple Data Sources
Apache OpenDAL: access data freely.