tez

Apache Tez

APACHE-2.0 License

Stars

472

Committers

View Code on GitHub Visit Website

Ecosystems: Cordova, Apache Spark, Java, Groovy, Apache Cassandra, Maven

Commit Statistics

Past Year

All Time

Total Commits

2,993

Total Committers

106

Avg. Commits Per Committer

2.76

28.24

Bot Commits

Issue Statistics

Past Year

All Time

Total Pull Requests

165

Merged Pull Requests

102

Total Issues

Time to Close Issues

N/A

Related Projects

oozie

Mirror of Apache Oozie

14 Sep 2011 708

StreamlineDE-

Welcome to StreamlineDE, an end-to-end data engineering project designed to demonstrate real-time...

07 Sep 2024 0

DataStreamingETL

Utilizing my background and love for Apache Airflow and Data to build a real-time data streaming ...

21 Jun 2024 0

data-engineering-interview-questions

More than 2000+ Data engineer interview questions.

08 Aug 2021 1,060

BigData-Interview

[大数据面试题]分享自己在网络上收集的大数据相关的面试题以及自己的答案总结.目前包含Hadoop/Hive/Spark/Flink/Hbase/Kafka/Zookeeper框架的面试题知识总结

12 Aug 2019 1,574

hcatalog

Mirror of Apache HCatalog

14 Apr 2011 61

atlas

Apache Atlas

22 Jul 2017 1,820

Sales-Analytics-Pipeline

Data analytics pipeline built with Apache Spark and Hadoop for processing and analyzing large-sca...

17 Jul 2024 0

sparksql-for-hbase

Learn how to use Spark SQL and HSpark connector package to create / query data tables that reside...

31 Aug 2017 69

Spark-with-Python

Fundamentals of Spark with Python (using PySpark), code examples

20 Aug 2018 328

hdfs-stream-processing

Streaming data processing using Hadoop HDFS, Spark, Kafka, Minio, Elasticsearch

21 Jul 2024 1

pig

Mirror of Apache Pig

21 May 2009 678

Spark-with-Python---My-learning-notes-

ETL pipeline using pyspark (Spark - Python)

13 Mar 2017 106

cdap

An open source framework for building data analytic applications.

02 Aug 2014 735

End-to-end-realtime-data-streaming

An end-to-end data engineering pipeline that orchestrates data ingestion, processing, and storage...

23 May 2024 0