Word Count using Apache Hadoop 3+
Statistics for this project are still being loaded, please check back later.
This tool extracts word vectors from Lucene index.
Superword is a Java open source project dedicated in the study of English words analysis and auxi...
Fastest word count in Java
自定制的精准短文本搜索服务
A basic generative text algorithm based on Markov Chains. Processes a given text file of words to...
基于Hadoop和HBase的大规模海量数据去重
Reference implementations of data-intensive algorithms in MapReduce and Spark
Projet shavadoop
Apache Tez
jsearch:高性能的全文检索工具包
Java分布式中文分词组件 - word分词
Java example of analyzing twitter data with hadoop map reduce.
HiBench is a big data benchmark suite.
Sonatype helps open source projects to set up Maven repositories on https://oss.sonatype.org/