Data quality testing for the modern data stack (SQL, Spark, and Pandas) https://www.soda.io
APACHE-2.0 License
Code-Native Data Pipelines
Simple Python client for interacting with Google BigQuery.
Visualize column-level data lineage in Spark SQL
Find data quality issues and clean your data in a single line of code with a Scikit-Learn compati...
High-level wrapper around BCP for high performance data transfers between pandas and SQL Server. ...
This library is inspired by the Great Expectations library. The library has made the various expe...
Simple GraphQL Client
CLI tool for dbt users to simplify creation of staging models (yml and sql) files
Molecular Processing Made Easy.
Software and instructions for setting up and running a self-driving lab (autonomous experimentati...