Quickstart PySpark with Anaconda on AWS/EMR
MIT License
Apache Spark Machine Learning project using MLlib and Linear Regression on Databricks!
TensorFlowOnSpark brings TensorFlow programs to Apache Spark clusters.
Spark examples
A commandline tool for analysis of big biological data sets for distributed HPC clusters.
The goal of this project is to offer an AWS EMR template using Spot Fleet and On-Demand Instances...
A simple Spark TDD example
A Grafana-based application to assist Big Data infrastructure optimization initiatives where Spar...
pandas on AWS - Easy integration with Athena, Glue, Redshift, Timestream, Neptune, OpenSearch, Qu...
Explore the capabilities of Amazon EMR Serverless by processing semi-structured review data with ...
This construct builds some elements for you to quickly launch an EMR Serverless application. Afte...