Mirror of Apache Oozie
APACHE-2.0 License
An open source framework for building data analytic applications.
EtlFlow is an ecosystem of functional libraries in Scala based on ZIO for running complex Auditab...
Mirror of Apache POI
Apache Spark - A unified analytics engine for large-scale data processing
Apache ZooKeeper
Service for extracting tables from the CCAO system-of-record and uploading them to the Data Depar...
Scalable stream processing platform for advanced realtime analytics on top of Kafka and Spark. Lo...
Apache Tez
Application Observable Platform in the Cloud Native Era
Hadoop deployment made easy: HA and Kerberos secured cluster in 1 command
A distributed data integration framework that simplifies common aspects of big data integration s...
Distributed scheduled job
REST job server for Apache Spark
Apache Software Foundation Parent POM
Mirror of Apache Pig