Daniel Lemire

Daniel Lemire is a computer science professor. His research is on software performance in data engineering.

Ecosystems: Node.js, Java, C++, C, Go, Linux, JSON, macOS, Apache Spark

Projects

RoaringBitmap

A better compressed bitset in Java: used by Apache Spark, Netflix Atlas, Apache Pinot, Tablesaw, and many others

Java - Released: 17 Jun 2013 - 3,504

roaring

Roaring bitmaps in Go (golang), used by InfluxDB, Bleve, DataDog

Go - Released: 10 Jul 2014 - 2,475

simdjson

Parsing gigabytes of JSON per second : used by Facebook/Meta Velox, the Node.js runtime, ClickHouse, WatermelonDB, Apache Doris, Milvus, StarRocks

C++ - Released: 23 Mar 2018 - 18,905

bitset

Go package implementing bitsets

Go - Released: 11 May 2011 - 1,295

JavaFastPFOR

A simple integer compression library in Java

Java - Released: 06 Jul 2012 - 531

fast_float

Fast and exact implementation of the C++ from_chars functions for number types: 4x to 10x faster than strtod, part of GCC 12, Chromium, Redis and WebKit/Safari

C++ - Released: 19 Oct 2020 - 1,452

CRoaring

Roaring bitmaps in C (and C++), with SIMD (AVX2, AVX-512 and NEON) optimizations: used by Apache Doris, ClickHouse, and StarRocks

C - Released: 01 Dec 2015 - 1,468

ada

WHATWG-compliant and fast URL parser written in modern C++, part of Node.js, Clickhouse, Redpanda, Kong, Telegram and Cloudflare Workers.

C++ - Released: 17 Dec 2022 - 1,348

simdutf

Unicode routines (UTF8, UTF16, UTF32) and Base64: billions of characters per second using SSE2, AVX2, NEON, AVX-512, RISC-V Vector Extension. Part of Node.js, WebKit/Safari and Bun.

C++ - Released: 04 Feb 2021 - 1,119

FastPriorityQueue.js

a fast heap-based priority queue in JavaScript

JavaScript - Released: 30 Sep 2015 - 355

javaewah

A compressed alternative to the Java BitSet class

Java - Released: 04 Oct 2010 - 559

Code-used-on-Daniel-Lemire-s-blog

This is a repository for the code posted on my blog

C - Released: 06 Apr 2012 - 796

FastPFor

The FastPFOR C++ library: Fast integer compression

C++ - Released: 26 Jun 2012 - 855

fast_double_parser

Fast function to parse strings into double (binary64) floating-point values, enforces the RFC 7159 (JSON standard) grammar: 4x faster than strtod

C++ - Released: 09 Mar 2020 - 621

EWAHBoolArray

A compressed bitmap class in C++.

C++ - Released: 01 Nov 2010 - 440

rcppsimdjson

Rcpp Bindings for the 'simdjson' Header Library

C++ - Released: 18 Jan 2020 - 116

externalsortinginjava

External-Memory Sorting in Java

Java - Released: 24 Mar 2014 - 250

fastmod

A C/C++ header file for fast 32-bit division remainders (and divisibility tests) on 64-bit hardware.

C++ - Released: 08 Feb 2019 - 290

roaring-node

Roaring for NodeJS

C++ - Released: 03 May 2018 - 108

FastBitSet.js

Speed-optimized BitSet implementation for modern browsers and JavaScript engines

JavaScript - Released: 18 Sep 2015 - 158

docker_programming_station

A simple script to help programmers who want to work within Docker

Shell - Released: 22 May 2020 - 150

fastbase64

SIMD-accelerated base64 codecs

C - Released: 30 Nov 2016 - 422

simdcomp

A simple C library for compressing lists of integers using binary packing

C - Released: 05 Feb 2014 - 480

FastIntegerCompression.js

Fast integer compression library in JavaScript

JavaScript - Released: 14 Apr 2016 - 58

streamvbyte

Fast integer compression in C using the StreamVByte codec

C - Released: 14 Apr 2016 - 364