Apache datasketches
APACHE-2.0 License
Dataproc templates and pipelines for solving simple in-cloud data tasks
Core C++ Sketch Library
High performance native memory access for Java.
Apache Drill Test Framework
Solution Accelerators for Serverless Spark on GCP, the industry's first auto-scaling and serverle...
macOS development environment setup: Easy-to-understand instructions with automated setup script...
Website for DataSketches.
Apache Atlas
A commandline tool for analysis of big biological data sets for distributed HPC clusters.
Apache Spark - A unified analytics engine for large-scale data processing
Sketch adaptors for Hive.
Java Sketch Characterization Code.
Auxiliary testing data files for Apache GraphAr (Incubating).