parquet2

Fastest and safest Rust implementation of parquet. `unsafe` free. Integration-tested against pyarrow

OTHER License

Downloads
1.2M
Stars
352
Committers
26

Bot releases are visible (Hide)

parquet2 - v0.17.0 Latest Release

Published by jorgecarleitao almost 2 years ago

A new release is out there!

Thanks everyone for the fixes and improvements resulting in this stabler, easier to use and faster version of parquet2!

Breaking changes:

New features:

Fixed bugs:

Enhancements:

parquet2 - v0.16.0

Published by jorgecarleitao about 2 years ago

Yet another release of parquet2, mostly focused on avoiding panics and oom. No impact on performance, but improves reliability.

v0.16.1 (2022-08-17)

Full Changelog

Fixed bugs:

v0.16.0 (2022-08-17)

Full Changelog

Breaking changes:

New features:

Fixed bugs:

Documentation updates:

parquet2 - v0.15.0

Published by jorgecarleitao about 2 years ago

We have a new release of parquet2 available!

Breaking changes:

New features:

  • Added feature flag to use zlib-ng backend for gzip #165 (ritchie46)

Fixed bugs:

Enhancements:

Testing updates:

parquet2 - v0.14.2

Published by jorgecarleitao about 2 years ago

A couple of bug fixes, by @jhorstmann and @v0y4g3r

Full Changelog

Fixed bugs:

Enhancements:

parquet2 - v0.14.1

Published by jorgecarleitao over 2 years ago

A small but important release to support legacy lz4, by @dantengsky 🚀

Full Changelog

New features:

Enhancements:

parquet2 - v0.14.0

Published by jorgecarleitao over 2 years ago

A new release is here and in crates.io! 🎉🎉🎉

Full Changelog

Breaking changes:

  • split_buffer should return Result #156

Fixed bugs:

Enhancements:

parquet2 - v0.13.0

Published by jorgecarleitao over 2 years ago

Another release of parquet2 is here!

We can now control the compression level of both GZIP and BROTLI compression thanks to @TurnOfACard 🙇

Thank you to everyone that contributed to this release!

Breaking changes:

New features:

  • Added new_with_page_meta to PageReader #136 (ygf11)
  • Added compression options/levels for GZIP and BROTLI codecs. #132 (TurnOfACard)

Fixed bugs:

  • Async FileStreamer does not write statistics #139
  • Fixed error in compressing lz4raw with large offsets #140 (jorgecarleitao)

Enhancements:

Testing updates:

parquet2 - v0.12.1

Published by jorgecarleitao over 2 years ago

Full Changelog

Fixed bugs:

parquet2 - v0.12.0

Published by jorgecarleitao over 2 years ago

Full Changelog

Breaking changes:

  • Add CompressionOptions, which allows for zstd compression levels. #128 (TurnOfACard)

Enhancements:

parquet2 - v0.11.0

Published by jorgecarleitao over 2 years ago

Here we are for a new release of parquet2. This release has 3 main features:

  • added optional support LZ4 compression and decompression in WASM builds (via LZ4-flex by @PSeitz)
  • added support to read bloom filters
  • added support to read and write page indexes

A summary of the Full Changelog is available below.

Thank you for everyone that contributed to this release! (credits to individual PRs below)

Breaking changes:

New features:

  • Added support for page-level filter pushdown (indexes) #102
  • Added support for bloom filters #98
  • Added optional support for LZ4 via LZ4-flex crate (thus enabling wasm) #124 (jorgecarleitao)
  • Added support for page-level filter pushdown (column and offset indexes) #107 (jorgecarleitao)
  • Added support to read column and page indexes #100 (jorgecarleitao)

Fixed bugs:

Enhancements:

Documentation updates:

Testing updates:

parquet2 - v0.10.1

Published by jorgecarleitao over 2 years ago

Full Changelog

Enhancements:

v0.10.0 (2022-02-02)

parquet2 - v0.10.0

Published by jorgecarleitao over 2 years ago

A new release is now available in crates.io.

Breaking changes:

Fixed bugs:

parquet2 - v0.9.2

Published by jorgecarleitao over 2 years ago

Fixed bugs:

parquet2 - v0.9.0

Published by jorgecarleitao almost 3 years ago

A new release is here, with a performance improvement to bitpacking by @danburkert , an important fix to write data, and one backward incompatible change to allow serializing and writing to happen on separate thread pools (e.g. via tokio_rayon).

Breaking changes:

Enhancements:

  • bitpacking: use stack-allocated temporary buffer #76 (danburkert)
  • Added constructor to RowGroupMetaData and ColumnChunkMetaData #74 (yjshen)
  • Improved performance of reading multiple pages #73 (jorgecarleitao)

Fixed bugs:

Full Changelog

parquet2 - v0.8.1

Published by jorgecarleitao almost 3 years ago

Full Changelog

Fixed bugs:

parquet2 - v0.8.0

Published by jorgecarleitao almost 3 years ago

Full Changelog

Breaking changes:

Fixed bugs:

parquet2 - v0.7.0

Published by jorgecarleitao almost 3 years ago

Full Changelog

Breaking changes:

  • Use i64s for delta-bitpacked's interface #67 (kornholi)

New features:

Fixed bugs:

  • Fix off-by-one error in delta-bitpacked decoder #66 (kornholi)
parquet2 - v0.6.0

Published by jorgecarleitao about 3 years ago

Another release is here. It contains 3 breaking changes and a couple of bug fixes (backported to 0.5).

Full Changelog

Breaking changes:

Fixed bugs:

Enhancements:

  • Added writing reduced statistics for FixedLenByteArray #55 (potter420)
parquet2 - v0.5.2

Published by jorgecarleitao about 3 years ago

Full Changelog

Fixed bugs:

Enhancements:

  • Added writing reduced statistics for FixedLenByteArray #55 (potter420)
parquet2 - v0.5.1

Published by jorgecarleitao about 3 years ago

Full Changelog

Fixed bugs: