The goal of this project is to offer an AWS EMR template using Spot Fleet and On-Demand Instances that you can use quickly. Just focus on writing pyspark code.
APACHE-2.0 License
No README available, please check again later.
Quickstart PySpark with Anaconda on AWS/EMR
This code is used to build & run a Docker container for performing predictions against a Spark M...
Apache Spark on AWS Lambda
Reference Implementation for AWS IoT FleetWise
A data engineering training project to build an end-to-end pipline for a real-time processing of ...
This construct builds some elements for you to quickly launch an EMR Serverless application. Afte...
A best practices guide for using AWS EMR. The guide will cover best practices on the topics of co...
Study Guide for AWS Big Data Speciality Certification
The goal of this project is to track the expenses of Uber Rides and Uber Eats through data Engine...
👷🌇 Set up and build a big data processing pipeline with Apache Spark, 📦 AWS services (S3, EMR, EC...
This project demonstrates data cleaning, processing with Apache Spark and Apache Flink, both loca...
Best practices and recommendations for getting started with Amazon EMR on EKS.
AWS SDK for the Rust Programming Language
A Terraform module for create ECS on Spot Fleet.