Optimized joins using bloom filters on Hadoop via Cascading.
APACHE-2.0 License
hadoop-mini-clusters provides an easy way to test Hadoop projects directly in your IDE
Provides support to increase developer productivity in Java when using Apache Cassandra. Uses fa...
Apache Spark - A unified analytics engine for large-scale data processing
The Hypersistence Utils library (previously known as Hibernate Types) gives you Spring and Hibern...
Dead-simple vertical partitioning, compression, appends, and consolidation of data on a distribut...
Taps for Cascalog
Apache Nutch is an extensible and scalable web crawler
Mirror of Apache HBase Third Party Libs
Apache Hive
HBase as a JSON Document Database
Adapters to write to ElephantDB using Cascading
Apache Causeway™ software is a framework for rapidly developing domain-driven apps in Java. This ...
Distributed database specialized in exporting key/value data from Hadoop
HiBench is a big data benchmark suite.