Big Data Ecosystem Docker
Dockerizing an Apache Spark Standalone Cluster
Generalist E-Commerce model for testing data pipelines and projects best practices
Desenvolvimento de uma Pipeline de Dados utilizando Azure Synapse
Projeto que completa a criação de um ambiente para extração, armazenamento e processamento de dad...
ezpz pyspark dev environment with docker
50+ DockerHub public images for Docker & Kubernetes - DevOps, CI/CD, GitHub Actions, CircleCI, Je...
base docker compose to setup the data engineering env in local
https://spark.apache.org/docs/latest/sql-getting-started.html#starting-point-sparksession
Hadoop deployment made easy: HA and Kerberos secured cluster in 1 command
Master's thesis on Big Data
Study project for big data (Hadoop, Zookeeper, Kafka, Flink, Spark)
Learn and understand Docker&Container technologies, with real DevOps practice!
Welcome to StreamlineDE, an end-to-end data engineering project designed to demonstrate real-time...
This project integrates real-time data processing and analytics using Apache NiFi, Kafka, Spark, ...
Personal docker images for various data science software stacks