The Apache Gora open source framework provides an in-memory data model and persistence for big data.
APACHE-2.0 License
Fundamentals of Spark with Python (using PySpark), code examples
More than 2000+ Data engineer interview questions.
📚 Awesome list for Data Lake
(Archived Warning)Please visit "apache/hugegraph" repo instead
Apache HoraeDB (Incubating) Golang Client.
Learn how to use Spark SQL and HSpark connector package to create / query data tables that reside...
Self-managed thirdparty dependencies for Apache Doris
A distributed data integration framework that simplifies common aspects of big data integration s...
An open source framework for building data analytic applications.
Apache Doris is an easy-to-use, high performance and unified analytics database.
Morpheus brings the leading graph query language, Cypher, onto the leading distributed processing...
A large-scale entity and relation database supporting aggregation of properties
GoCQL Driver for Apache Cassandra®
Mirror of Apache HCatalog
YTsaurus is a scalable and fault-tolerant open-source big data platform.