Terraform module to provision an Elastic MapReduce (EMR) cluster on AWS
APACHE-2.0 License
Apache Software Foundation Parent POM
A collection of tools and best practices to take ShardingSphere into the cloud
Dataproc templates and pipelines for solving simple in-cloud data tasks
This repo contains examples of high throughput ingestion using Apache Spark and Apache Iceberg. T...
Mirror of Apache Myriad (Incubating)
Apache Maven Project Parent POMs
Data science Python notebooks: Deep learning (TensorFlow, Theano, Caffe, Keras), scikit-learn, Ka...
An open source framework for building data analytic applications.
Solution Accelerators for Serverless Spark on GCP, the industry's first auto-scaling and serverle...
Apache NiFi - MiNiFi C++
CloudStack Terraform Provider
A full-featured license tool to check and fix license headers and resolve dependencies' licenses.
A Grafana-based application to assist Big Data infrastructure optimization initiatives where Spar...
Official Kubernetes operator for Apache Solr
Harry for Apache Cassandra®