Apache Kyuubi is a distributed and multi-tenant gateway to provide serverless SQL on data warehouses and lakehouses.
APACHE-2.0 License
📚 Awesome list for Data Lake
More than 2000+ Data engineer interview questions.
Utilizing my background and love for Apache Airflow and Data to build a real-time data streaming ...
Includes notes on using Apache Spark in general, notes on using Spark for Physics, how to run TPC...
Apache ShenYu is a Java native API Gateway for service proxy, protocol conversion and API governa...
Hadoop deployment made easy: HA and Kerberos secured cluster in 1 command
A library that brings useful functions from various modern database management systems to Apache ...
YTsaurus is a scalable and fault-tolerant open-source big data platform.
Apache Kyuubi Shaded Dependencies.
Fundamentals of Spark with Python (using PySpark), code examples
Apache Kyuubi is a distributed and multi-tenant gateway to provide serverless SQL on data warehou...
Apache Kylin
Kylo is a data lake management software platform and framework for enabling scalable enterprise-c...
This project integrates real-time data processing and analytics using Apache NiFi, Kafka, Spark, ...
Dockerizing an Apache Spark Standalone Cluster