BigQuery Ecosystem

Google BigQuery enables companies to handle large amounts of data without having to manage infrastructure. Google’s documentation describes it as a « serverless architecture (that) lets you use SQL queries to answer your organization’s biggest questions with zero infrastructure management. BigQuery’s scalable, distributed analysis engine lets you query terabytes in seconds and petabytes in minutes. » Its client libraries allow the use of widely known languages such as Python, Java, JavaScript, and Go. Federated queries are also supported, making it flexible to read data from external sources.

📖 A highly rated canonical book on it is « Google BigQuery: The Definitive Guide », a comprehensive reference. Another enriching read on the subject is the inside story told in the article by the founding product manager of BigQuery celebrating its 10th anniversary.

Released
May 19, 2010
Community Repos
1,600
Total GitHub Stars
171,040
Core Projects
More
Sample cloud-first application with 10 microservices showcasing Kubernetes, Istio, and gRPC
terraformer
11,825
CLI tool to generate terraform files from existing infrastructure (reverse Terraform)
Labs and demos for courses for GCP Training (http://cloud
Popular Projects 
More

cube

📊 Cube — The Semantic Layer for Building Data Applications

16 Sep 2018 17,825

redash

Make Your Company Data Driven

28 Oct 2013 25,090

cloudquery

The open source high performance ELT framework powered by Apache Arrow

18 Nov 2020 5,575

sqlfluff

A modular SQL linter and auto-formatter with support for multiple dialects and templated code

01 Nov 2018 7,226

bigrquery

An interface to Google's BigQuery from R

22 May 2013 512

growthbook

Open Source Feature Flagging and A/B Testing Platform

07 May 2021 6,084

briefer

Dashboards and notebooks in a single place

09 Sep 2024 3,168

airbyte

The leading data integration platform for ETL / ELT data pipelines from APIs, databases & files to data warehouses, data lakes & data lakehouses

27 Jul 2020 13,821

google-cloud-php

Google Cloud Client Library for PHP

04 Oct 2015 1,080

python-bigquery-pandas

Google BigQuery connector for pandas

08 Feb 2017 424

nodejs-bigquery

Node

26 Jul 2017 458

python-bigquery-sqlalchemy

SQLAlchemy dialect for BigQuery

07 Jun 2017 423

graphql-engine

Blazing fast, instant realtime GraphQL APIs on your DB with fine grained access control, also trigger webhooks on database events

18 Jun 2018 30,790

ibis

the portable Python dataframe library

17 Apr 2015 4,709

dataform

Dataform is a framework for managing SQL based data operations in BigQuery

03 Sep 2018 837

rudder-server

Privacy and Security focused Segment-alternative, in Golang and React

19 Jul 2019 3,910

bigquery-emulator

BigQuery emulator server implemented in Go

20 Jun 2022 804

scio

A Scala API for Apache Beam and Google Cloud Dataflow

26 Mar 2015 2,520

ingestr

ingestr is a CLI tool to copy data between any databases with a single command seamlessly

12 Feb 2024 2,315

elementary

The dbt-native data observability solution for data & analytics engineers

30 Aug 2021 1,725
Up and Coming Projects 
More

promptweaver

PromptWeaver streamlines prompt development and management in Generative AI workflows

09 Sep 2024 5

briefer

Dashboards and notebooks in a single place

09 Sep 2024 3,168

bigquery-helper

A helper package for Google BigQuery operations

29 Aug 2024 7

vertex-ai-creative-studio

Creative Studio is a Vertex AI generative media example user experience to highlight the use of Imagen and other generative media APIs on Google Cloud

15 Aug 2024 8

Google-Cloud-Professional-Data-Engineer-ACompleteGuide

This Repo contains all study, lab and supportive materials for Udemy course on "Google Cloud Professional Data Engineer - A Complete Guide"

11 Aug 2024 3

energy-data-analysis

This is the cloud model analyzing real world dataset with BigQuery and other big-data analyzing tools

04 Aug 2024 0

terraform-gcp-datadog-integration

Terraform code to make the Google Cloud Platform to Datadog log collection integration easier

04 Aug 2024 5

terraform-google-apphub

Creates and manages AppHub resources

31 Jul 2024 1

reCAPTCHA-PLD

29 Jul 2024 4

ra_dbt_to_dataform

An open-source tool that partially automates the migration of dbt packages to Dataform

27 Jul 2024 4

pypi-package-stats

Project for ingest pypi packages data from BigQuery and send to DataDog for analysis and insights with dashboards, monitors and more

25 Jul 2024 0

tag-automator

Simple tool to easily visualize and manage tag bindings across organization, folder or project

18 Jul 2024 5

bq-schema-sync

A tool to synchronize BigQuery table schemas with local definitions

11 Jul 2024 0

purdue

The internship was broadly to understand if the topics/events are being covered differently in the different countries and how they affect stock market returns

26 Jun 2024 0

accelerated-platforms

26 Jun 2024 5

bigquery-jupyter-plugin

BigQuery JupyterLab Plugin

24 Jun 2024 2