A personal project that builds an end-to-end data pipeline using the 2024 Olympics data.
An end-to-end data engineering pipeline that orchestrates data ingestion, processing, and storage...
Extract data from many databases of Labor, Invalids and Social Affairs sectors and convert to app...
Master's thesis on Big Data
Data Lakehouse local stack with PySpark, Trino, and Minio. Includes an example to process Raygun ...
Udacity Data Engineering Nanodegree Program, Data Pipeline with Airflow project using MinIO and P...
Nyc_Taxi_Data_Pipeline - DE Project
Welcome to the Spotify Insights Data Pipeline Project where I analyze data from my Spotify listen...
End-to-end data pipeline that ingests, processes, and stores data. It uses Apache Airflow to sche...
Udacity Data Engineering Nano Degree (DEND)
Welcome to StreamlineDE, an end-to-end data engineering project designed to demonstrate real-time...
Utilizing my background and love for Apache Airflow and Data to build a real-time data streaming ...
This is a comprehensive solution for real-time football analytics, leveraging Apache Spark execut...
An end-to-end GoodReads Data Pipeline for Building Data Lake, Data Warehouse and Analytics Platform.
This project integrates real-time data processing and analytics using Apache NiFi, Kafka, Spark, ...