Mirror of Apache Pig
APACHE-2.0 License
crawl GooglePlay data with Nutch, ETL with Pig, analyze with Hive
Apache Tez
Apache Hive
Mirror of Apache Oozie
Apache Spark - A unified analytics engine for large-scale data processing
Personal development repository to prepare contributions and patches for Apache Mahout
Fork of Paper for 1.8.8 focused on improved performance and stability.
Apache Iceberg
Apache Beam is a unified programming model for Batch and Streaming data processing.
A collection of Hadoop Pig utilities.
Apache Flink