datasketches-bigquery

Apache datasketches

APACHE-2.0 License

Stars
2
Committers
3

Apache DataSketches library functions for Google BigQuery

User-Defined Aggregate Functions (UDAFs) and non-aggregate (scalar) functions (UDFs) for BigQuery SQL engine.

Please visit the main Apache DataSketches website for more information about DataSketches library.

If you are interested in making contributions to this project please see our Community page for how to contact us.

Requirements

Building, Installing, and Testing

  • Requires setting environment variables
    • GCS_BUCKET: to hold compiled artifacts
    • BQ_DATASET: location of stored SQL functions (routines)
make          # performs compilation
make install  # upload to $GCS_BUCKET & create functions in $BQ_DATASET
make test     # runs predefined tests in BQ

The above steps can be executed in the root directory to install everything, or can be run from an individual sketch directory to install only that particular sketch.

Examples

For examples see test directories in individual sketch directories.