lance

Modern columnar data format for ML and LLMs implemented in Rust. Convert from parquet in 2 lines of code for 100x faster random access, vector index, and data versioning. Compatible with Pandas, DuckDB, Polars, Pyarrow, with more integrations coming..

APACHE-2.0 License

Downloads
814.7K
Stars
3.8K

Bot releases are visible (Hide)

lance - v0.2.0 Dataset Versioning, DuckDB extension built with CUDA

Published by eddyxu almost 2 years ago

Highlights

  • Lance Dataset versioning support
  • Duckdb Extension supports building against PyTorch with Cuda
  • Revamp README and documentation.

What's Changed

Full Changelog: https://github.com/eto-ai/lance/compare/v0.1.5...v0.2.0

lance - v0.1.5 Pandas Extension Type, Jupyter Notebook and Document Improvements

Published by eddyxu almost 2 years ago

What's Changed

Full Changelog: https://github.com/eto-ai/lance/compare/v0.1.4...v0.1.5

What's Changed

Full Changelog: https://github.com/eto-ai/lance/compare/v0.1.3...v0.1.4

lance - Document improvements and bug fixes

Published by eddyxu about 2 years ago

What's Changed

Full Changelog: https://github.com/eto-ai/lance/compare/v0.1.2...v0.1.3

lance - v0.1.2

Published by changhiskhan about 2 years ago

  1. Lance now supports projection for nested column (e.g., "annotations.name")
  2. There's also a fast path for CountRows to get the record count by looking at metadata
  3. Finally, Lance now supports writing optional key-value metadata (pa.Table.schema.metadata)

What's Changed

Full Changelog: https://github.com/eto-ai/lance/compare/v0.1.1...v0.1.2

lance - v0.1.1

Published by changhiskhan about 2 years ago

Fix up Mac wheel to enable extension types for MacOS

What's Changed

New Contributors

Full Changelog: https://github.com/eto-ai/lance/compare/v0.1.0...v0.1.1

lance - v0.1.0

Published by changhiskhan about 2 years ago

Highlights

  1. Documentation is now live and a Quickstart Notebook is available
  2. Lance is now integrated with pytorch and supports multiple workers.
  3. Vision-specific extension types like Box2d provides vectorized iou and Image types that make it easy to perform IO and go between bytes, PIL, numpy, and tensors.

What's Changed

Full Changelog: https://github.com/eto-ai/lance/compare/v0.0.5...v0.1.0

lance - Support extension types, fixed size list and fixed size binary

Published by eddyxu about 2 years ago

What's Changed

Full Changelog: https://github.com/eto-ai/lance/compare/v0.0.4...v0.0.5

lance - Benchmarks, bug fixes, and writer improvements

Published by changhiskhan about 2 years ago

Add benchmarks vs parquet and raw json data
Fixes #112, which caused lance datasets to be written with duplicated Arrow buffers
Adds support for large binary, boolean, and temporal types.

What's Changed

Full Changelog: https://github.com/eto-ai/lance/compare/v0.0.3...v0.0.4