btree4j

Disk-based B+-tree written in Pure Java

APACHE-2.0 License

Stars

212

Committers

View Code on GitHub

Ecosystems: Java

btree4j: Disk-based Prefix B+-tree written in Pure Java

What's Btree4j

Btree4j is a disk-based Prefix B+-tree written in Pure Java.

It's pretty fast and 100k ops/sec is expected even on laptop.

Using btree4j

<dependency>
    <groupId>io.github.myui</groupId>
    <artifactId>btree4j</artifactId>
    <version>0.9.1</version>
</dependency>

Find usage in unit tests.

Features and Strength

Applied many improvements over the original Xindice's implementation as follows:

Implementes Prefix B+-tree in which prefixes are selected carefully to minimize their length. In prefix B+-tree, key prefixes are managed by a TRIE-like smart algorithm.

Rudolf Bayer and Karl Unterauer. "Prefix B-trees", Proc. ACM Trans. Database Syst. 2, 1, pp.11-26), March 1977. [DOI]

Pointers are compressed using Variable Byte Codes so that more keys/values are fit in memory.
Support both unique and non-unique indexing. Storing duplicate keys is allowed for non-unique indexing.
Index file based on B+-tree is supported. Multiple values per a key is also supported. BTreeIndex stores <bytes[] KEY, byte[] VALUE> with VALUE stored on distinct data pages and pointers to them are managed by B+-Tree to avoid consuming many disk pages for large values in B+-tree.
Support variable-length key/value
Prefix and range search is supported in addition to exact match using a flexible callback handler. Support LIKE match using wildcards.
Paging (virtual memory) support using LRU cache replacement policy and Freespace management.
Deletion and updates are, of course, supported.
Support efficient Bulk-loading.
Minimum dependencies to external libraries. Runs on Java 8 or later.

Sponsors

No sponsors yet. Will you be the first?

It will be my motivation to continue working on this project.

Credits

Copyright 2006 and onwards Makoto Yui Copyright 1999-2007 The Apache Software Foundation

This software is originally developed for XBird based on Apache Xindice.

Package Rankings

Top 28.41% on Repo1.maven.org

Badges

Extracted from project README

Build Status

License

Related Projects

RoaringBitmap

A better compressed bitset in Java: used by Apache Spark, Netflix Atlas, Apache Pinot, Tablesaw, ...

17 Jun 2013 3,504

SwayDB

Persistent and in-memory key-value storage engine for JVM that scales on a single machine.

11 Feb 2018 293

java-knowledge-mind-map

【🌱🌱Java服务端知识技能图谱】用思维脑图梳理汇总Java服务端知识技能

26 Dec 2018 904

bucket4j

Java rate limiting library based on token-bucket algorithm.

12 Oct 2014 2,260

JCSprout

👨‍🎓 Java Core Sprout : basic, concurrent, algorithm

17 Dec 2017 27,064

JavaKeeper

✍️ Java 工程师必备架构体系知识总结：涵盖分布式、微服务、RPC等互联网公司常用架构，以及数据存储、缓存、搜索等必备技能

03 Sep 2019 1,866

paicoding

⭐️一款好用又强大的开源社区，基于 Spring Boot、MyBatis-Plus、MySQL、Redis、ElasticSearch、MongoDB、Docker、RabbitMQ 等主流技...

06 Jul 2022 2,015

bean-searcher

🔥🔥🔥 A read-only ORM focusing on advanced query, naturally supports joined tables, and avoids DTO/...

13 Jun 2017 1,129

BPlusTree

数据库应用 B+ 树(Java)

treetops

Fast LightGBM tree model inference Java library which is based on ASM dynamic code generation fra...

JavaNotes

🧱 「Java学习」一份涵盖大部分Java程序员所需要掌握的核心知识。JDK 源码分析 & Java 新特性 & Java 并发编程 & Java 虚拟机 & SpringBoot 2.x 系列

28 May 2020 473

jdbi

The Jdbi library provides convenient, idiomatic access to relational databases in Java and other ...

17 Jun 2009 1,966

tree-api

Provide an api for trees

hibernate-search

Hibernate Search: full-text search for domain model

15 Oct 2010 488

prune

A tree library for Java 8 with functional sensibilities.