Dataframes powered by a multithreaded, vectorized query engine, written in Rust
OTHER License
Bot releases are visible (Hide)
Published by ritchie46 over 2 years ago
num_cpus
from polars by @dandxy89 in https://github.com/pola-rs/polars/pull/2890
polars-time
. by @ritchie46 in https://github.com/pola-rs/polars/pull/2918
value_counts
and unique_counts
expression by @ritchie46 in https://github.com/pola-rs/polars/pull/2947
n
param to 1 by @cnpryer in https://github.com/pola-rs/polars/pull/3090
n
and frac
are both passed by @cnpryer in https://github.com/pola-rs/polars/pull/3091
~2-25x
improvement by @ritchie46 in https://github.com/pola-rs/polars/pull/3143
semi
and anti
joins. by @ritchie46 in https://github.com/pola-rs/polars/pull/3149
Full Changelog*: https://github.com/pola-rs/polars/compare/rust-polars-v0.20.0...rust-polars-v0.21.
Published by ritchie46 over 2 years ago
This release of 286 commits is here thanks to the contributions of: (in no specific order):
did I forget your contribution, please ping me, I do this manually 🙈
Most notable changes are:
Made representation of groups tuples more cache friendly #2431
Remove Seek
requirement of readers
Add groupby_rolling
as new entrance to expression API.
Improve CSV parsers stability and performance on several occasions
Horizontal aggregations are parallelized #2454
Reduce pivot code bloat and improve performance #2458
Struct
data type added.
Extend
methods that allow modification of the same memory if Arc::ref_count == 1
Avro readers and writers.
Improved rules of window expressions.
Support for us
time unit.
Parquet use statistics in query optimizations.
Optimize projections in lazy computations. (Mostly useful when you deal with a large number of columns e.g. millions).
Improve performance and flexibility of melt operation @2799
new expressions
See the 0.10.0 release for all upstream improvements.