Dataframes powered by a multithreaded, vectorized query engine, written in Rust
OTHER License
Bot releases are visible (Hide)
Published by ritchie46 almost 2 years ago
Thank you to all our contributors for making this release possible!
@alexander-beedie, @ghuls, @ritchie46 and @stinodego
Published by github-actions[bot] almost 2 years ago
timedelta
with duration-type arguments (#5487)Series
name when exporting to pandas
(#5498)Thank you to all our contributors for making this release possible!
@alexander-beedie, @braaannigan, @ghuls, @ritchie46, @sorhawell and @zundertj
Published by github-actions[bot] almost 2 years ago
IPython
users (#5477)fill_null
with temporal literals (#5440)DataFrame
and LazyFrame
API docs, misc design improvements (#5433)Thank you to all our contributors for making this release possible!
@alexander-beedie, @dannyvankooten, @ritchie46, @s1ck, @slonik-az, @stinodego and @universalmind303
Published by github-actions[bot] almost 2 years ago
width
property to LazyFrame
(#5431)Series.dot
method and related interop (#5428)DataFrame
init from generators (#5424)Series
init from generator (#5411)Thank you to all our contributors for making this release possible!
@CalOmnie, @alexander-beedie, @ghuls, @ritchie46, @slonik-az, @stinodego and @universalmind303
Published by github-actions[bot] almost 2 years ago
Series
from python range
object (#5397)DataFrame
operators (#5394)Thank you to all our contributors for making this release possible!
@YuRiTan, @alexander-beedie, @braaannigan, @owrior, @ritchie46 and @zundertj
Published by github-actions[bot] almost 2 years ago
Most notable mention this release is the start of Out Of Core support in polars, meaning we are able to process larger than RAM datasets. This is currently supported for parts of queries that read from csv
or parquet
and are limited to select
, filter
, and groupby
operations. Many more operations will follow in next releases.
See https://github.com/pola-rs/polars/pull/5139#issuecomment-1274687634 where we were able to process a 80GB dataset on a laptop with only 16GB RAM.
Thanks to everyone who contributed to another release! 🙌
n_chars
functionality for utf8 strings (#5252)Config
formatting option set_tbl_column_data_type_inline
, fixed reading of env vars, improved interaction between formatting options (#5243)extract
conversion for Time
datatype (#5161)time
object (#5152)date
>> time
cast (add 00:00:00
fast-path) (#5149)ewm_mean(alpha=1)
(#5019)release-drafter
to draft releases with changelogs (#5033)polars-lazy
crate. (#5020)Thank you to all our contributors for making this release possible!
@AlecZorab, @YuRiTan, @alexander-beedie, @cjermain, @dannyvankooten, @dpatton-gr, @egorchakov, @ghuls, @hpux735, @matteosantama, @mcrumiller, @owrior, @ritchie46, @slonik-az, @sorhawell, @stinodego, @thatlittleboy, @universalmind303 and @zundertj
Published by github-actions[bot] almost 2 years ago
Thank you to all our contributors for making this release possible!
@AlecZorab, @alexander-beedie, @ghuls and @ritchie46
Published by github-actions[bot] almost 2 years ago
Thank you to all our contributors for making this release possible!
@alexander-beedie, @ghuls, @ritchie46, @thatlittleboy, @universalmind303 and @zundertj
Published by github-actions[bot] almost 2 years ago
- ~85%
(#5287)n_chars
functionality for utf8 strings (#5252)Config
formatting option set_tbl_column_data_type_inline
, fixed reading of env vars, improved interaction between formatting options (#5243)BatchedCsvReader.next_batches()
is None befor… (#5256)Thank you to all our contributors for making this release possible!
@alexander-beedie, @ghuls, @owrior and @ritchie46
Published by github-actions[bot] about 2 years ago
Thank you to all our contributors for making this release possible!
@ritchie46
Published by github-actions[bot] about 2 years ago
polars_type_to_constructor
works with tz-aware Datetime
dtypes (#5239)tuple[bool, bool]
instead of Sequence[bool]
for Expr.is_between
(#5094)Thank you to all our contributors for making this release possible!
@YuRiTan, @alexander-beedie, @cjermain, @matteosantama, @ritchie46 and @stinodego
Published by github-actions[bot] about 2 years ago
DataFrame
init with Datetime dtypes that specify a timezone (#5174)n_unique()
that can count unique rows or col/expr subsets (#5165)extract
conversion for Time
datatype (#5161)time
object (#5152)list
types are better defined as Sequence
(#5164)Thank you to all our contributors for making this release possible!
@alexander-beedie, @dannyvankooten, @ghuls, @ritchie46 and @sorhawell
Published by github-actions[bot] about 2 years ago
Thank you to all our contributors for making this release possible!
@alexander-beedie, @owrior, @ritchie46 and @slonik-az
Published by github-actions[bot] about 2 years ago
Thank you to all our contributors for making this release possible!
@mcrumiller, @ritchie46 and @zundertj
Published by github-actions[bot] about 2 years ago
Series
init as struct from @dataclass
and annotated NamedTuple
(#5057)from_arrow
(#5065)show_graph
(#5059)release-drafter
to draft releases with changelogs (#5033)Thank you to all our contributors for making this release possible!
@alexander-beedie, @egorchakov, @matteosantama, @ritchie46, @slonik-az, @stinodego and @zundertj
Published by stinodego about 2 years ago
Published by stinodego about 2 years ago
Published by ritchie46 about 2 years ago
This is the release of rust polars 0.24.0. This release comes with a lot of bug fixes, performance improvements and added functionality. The changes that stand out are larger than RAM memory mapping of IPC files and a new common-subplan-optimization that prunes duplicated sub-plan from the query plan and thereby potentially save a lot of duplicated work.
See the 0.14.0 release for all upstream improvements.
Full Changelog: https://github.com/pola-rs/polars/compare/rust-polars-v0.23.0...rust-polars-v0.24.0
Published by ritchie46 about 2 years ago
ljust
and rjust
expressions by @ritchie46 in https://github.com/pola-rs/polars/pull/3603
scan_ipc/parquet
can scan from fsspec sources e.g. s3
. by @ritchie46 in https://github.com/pola-rs/polars/pull/3626
py-polars
by @ryanrussell in https://github.com/pola-rs/polars/pull/3700
polars-lazy
readability improvements by @ryanrussell in https://github.com/pola-rs/polars/pull/3701
DataFrame.hstack()
by @adamgreg in https://github.com/pola-rs/polars/pull/3697
Series
to DataFrame.with_columns()
argument annotation by @adamgreg in https://github.com/pola-rs/polars/pull/3696
contains
check that opts-in to contains_literal
fast-path by @alexander-beedie in https://github.com/pola-rs/polars/pull/3736
arg_where
expression by @ritchie46 in https://github.com/pola-rs/polars/pull/3757
(tpch 2/7) ~5%
faster by @ritchie46 in https://github.com/pola-rs/polars/pull/3774
DataFrame
, LazyFrame
, and Series
by @alexander-beedie in https://github.com/pola-rs/polars/pull/3791
date_range
to produce date
ranges as well as datetime
by @alexander-beedie in https://github.com/pola-rs/polars/pull/3798
chunked_array
readability improvements by @ryanrussell in https://github.com/pola-rs/polars/pull/3810
/polars/polars-core/src/frame/
readability by @ryanrussell in https://github.com/pola-rs/polars/pull/3813
~35-40%
by @ritchie46 in https://github.com/pola-rs/polars/pull/3821
agg_list
/not_aggregated
combination by @ritchie46 in https://github.com/pola-rs/polars/pull/3835
null_probability
functionality for dataframes/series test strategies. by @alexander-beedie in https://github.com/pola-rs/polars/pull/3860
clone
ops by @alexander-beedie in https://github.com/pola-rs/polars/pull/3883
take_every
by @alexander-beedie in https://github.com/pola-rs/polars/pull/3888
tests_parametric
dir by @alexander-beedie in https://github.com/pola-rs/polars/pull/3899
__setitem__
and take
by @stinodego in https://github.com/pola-rs/polars/pull/3910
~3x
by @ritchie46 in https://github.com/pola-rs/polars/pull/3924
with_columns
to allow **kwargs style named expressions by @alexander-beedie in https://github.com/pola-rs/polars/pull/3917
assert_frame_equal
and assert_series_equal
for NaN values by @alexander-beedie in https://github.com/pola-rs/polars/pull/3941
See Also
docstring formatting, quietened the last warnings coming from doctests
by @alexander-beedie in https://github.com/pola-rs/polars/pull/3932
LazyFrame
(efficient computation paths only) by @alexander-beedie in https://github.com/pola-rs/polars/pull/3970
~-22%
by @ritchie46 in https://github.com/pola-rs/polars/pull/4006
orient
type hint by @stinodego in https://github.com/pola-rs/polars/pull/3961
literal
param to string-replace functions, optimized replace
performance in small-string regime (30-80% faster) by @alexander-beedie in https://github.com/pola-rs/polars/pull/4057
orient
argument by @stinodego in https://github.com/pola-rs/polars/pull/4065
>4x
performance improvement by @ritchie46 in https://github.com/pola-rs/polars/pull/4078
ChunkedArray
. by @ritchie46 in https://github.com/pola-rs/polars/pull/4105
fill_nan
preserve name by @ritchie46 in https://github.com/pola-rs/polars/pull/4119
is_in
for categoricals by @ritchie46 in https://github.com/pola-rs/polars/pull/4153
pyproject.toml
by @matteosantama in https://github.com/pola-rs/polars/pull/4211
Full Changelog: https://github.com/pola-rs/polars/compare/rust-polars-v0.22.1...rust-polars-v0.23.0
Published by ritchie46 over 2 years ago
PartitionedWriter
for disk partitioning. by @illumination-k in https://github.com/pola-rs/polars/pull/3331
rolling_{min/max/sum/mean}
prerformance ~3.4x
by @ritchie46 in https://github.com/pola-rs/polars/pull/3444
rolling_var
performance by @ritchie46 in https://github.com/pola-rs/polars/pull/3470
~15x
perf gain. by @ritchie46 in https://github.com/pola-rs/polars/pull/3489
Experimental
Allow rolling_<agg>
expressions to determine window size by another {Date, Datetime}
series. by @ritchie46 in https://github.com/pola-rs/polars/pull/3514
~10x
perf gain (window of 100 elements) by @ritchie46 in https://github.com/pola-rs/polars/pull/3515
sorted_merge_join
by @ritchie46 in https://github.com/pola-rs/polars/pull/3505
~2x json
parsing improvement by @ritchie46 in https://github.com/pola-rs/polars/pull/3588
Full Changelog: https://github.com/pola-rs/polars/compare/rust-polars-v0.21.1...rust-polars-v0.22.1