The leading data integration platform for ETL / ELT data pipelines from APIs, databases & files to data warehouses, data lakes & data lakehouses. Both self-hosted and Cloud-hosted.
OTHER License
A modular SQL linter and auto-formatter with support for multiple dialects and templated code.
Sample code and notebooks for Generative AI on Google Cloud, with Gemini on Vertex AI
The open source high performance ELT framework powered by Apache Arrow
Dataproc templates and pipelines for solving simple in-cloud data tasks
Open Source Feature Flagging and A/B Testing Platform
The Open Data QnA python library enables you to chat with your databases by leveraging LLM Agents...
Airbyte made simple (no UI, no database, no cluster)
Cloud Code for Visual Studio Code: Issues, Documentation and more
Marketing Analytics Jumpstart consists of an easy, extensible and automated implementation of an ...
The dbt-native data observability solution for data & analytics engineers. Monitor your data pipe...
🔥🔥🔥 Open Source Alternative to Hightouch, Census, and RudderStack. Leading Reverse ETL and Custom...
Astro SDK allows rapid and clean development of {Extract, Load, Transform} workflows using Python...
A repo for Apigee X/hybrid samples
Firehose is an extensible, no-code, and cloud-native service to load real-time streaming data fro...
CLI tool for dbt users to simplify creation of staging models (yml and sql) files