hadoop_system

✅ hadoop eco system을 구성하고 파이프라인 제작합니다.

Stars

Committers

View Code on GitHub

Ecosystems: Apache Spark

Commit Statistics

Past Year

All Time

Total Commits

Total Committers

Avg. Commits Per Committer

13.67

Bot Commits

Issue Statistics

Past Year

All Time

Total Pull Requests

Merged Pull Requests

Total Issues

Time to Close Issues

12 days

Related Projects

cdhproject

hadoop各组件使用，持续更新

21 Nov 2017 896

hdfs-stream-processing

Streaming data processing using Hadoop HDFS, Spark, Kafka, Minio, Elasticsearch

21 Jul 2024 1

apache-spark-docker

Dockerizing an Apache Spark Standalone Cluster

19 Jul 2021 40

Sales-Analytics-Pipeline

Data analytics pipeline built with Apache Spark and Hadoop for processing and analyzing large-sca...

17 Jul 2024 0

LearningSparkV2

This is the github repo for Learning Spark: Lightning-Fast Data Analytics [2nd Edition]

10 Feb 2019 1,178

DIY-A-Cluster

How to Do-It-Yourself A Cluster for Spark & Hadoop

16 Sep 2016 11

learning-hadoop-and-spark

Companion to Learning Hadoop and Learning Spark courses on Linked In Learning

22 Jun 2019 182

sparkini

base docker compose to setup the data engineering env in local

21 Jul 2024 0

pyspark-maestro

This repo contains implementations of PySpark for real-world use cases for batch data processing,...

23 Jul 2024 1

eat_pyspark_in_10_days

pyspark🍒🥭 is delicious，just eat it!😋😋

24 Dec 2020 684

spark

Apache Spark - A unified analytics engine for large-scale data processing

25 Feb 2014 38,255

utils4s

scala、spark使用过程中，各种测试用例以及相关资料整理

24 Sep 2015 1,089

spark-workshop

Apache Spark™ and Scala Workshops

10 Mar 2016 260

spark-py-notebooks

Apache Spark & Python (pySpark) tutorials for Big Data Analysis and Machine Learning as IPython /...

06 May 2015 1,614

bigdata-playground

A complete example of a big data application using : Kubernetes (kops/aws), Apache Spark SQL/Stre...

12 Dec 2017 208