Dataframes powered by a multithreaded, vectorized query engine, written in Rust
OTHER License
Bot releases are visible (Hide)
Published by github-actions[bot] over 1 year ago
item
method to optionally take row/col indices (#8412)List
dtypes (#8400)Config
object in context-manager context (#8394)Series.is_integer
(#8383)Series
initialised with nested tuple data as Object
dtype (#8401)iter_rows
doesn't return nested Timestamp
values (#8359)__hash__
support to Field
, include "time_zone" in Datetime
hash, fix Struct
hash (#8354)window_size
user input in rolling_expr (#8318)read_excel
(#8300)List
dtypes (#8400)duration
docstring/example (#8392)strptime
(#8345)Thank you to all our contributors for making this release possible!
@JoonHong-Kim, @MarcoGorelli, @StefanBRas, @alexander-beedie, @avimallu, @grantmcdermott, @jonashaag, @rben01, @ritchie46, @stinodego and @universalmind303
Published by github-actions[bot] over 1 year ago
top_k
fast path (#8275)concat_owned_array_unchecked
when possible (#8274)Thank you to all our contributors for making this release possible!
@MarcoGorelli, @alexander-beedie, @ritchie46, @stinodego, @zaynetro and @zundertj
Published by github-actions[bot] over 1 year ago
DataFrame
init from pydantic
model data (#8178)fmt
is provided (#8111)arg_min
/arg_max
(via argminmax
) (#8074)arr.eval
run on groupby expression engine when possible (#8199)DataFrame
init from pydantic
models (#8181)use_earliest
argument to replace_time_zone
for dealing with ambiguous datetimes (#8087)series OP expr
-> pl.lit(series) OP expr
where OP is arithmetic (#8225)LazyFrame
(#8220)DataFrame
init from nested dataclass
, pydantic
, and NamedTuple
objects (#8185)approx_unique()
(#7937)describe
methods (#8169)DataFrame
init from pydantic
model data (#8178)strptime
/strftime
args (#8221)Expr.list
to implode
(#8165)Thank you to all our contributors for making this release possible!
@MarcoGorelli, @alexander-beedie, @avimallu, @borchero, @chitralverma, @clickingbuttons, @ghuls, @josh, @jvdd, @rben01, @ritchie46, @stinodego and @universalmind303
Published by github-actions[bot] over 1 year ago
DataFrame
generation (#8149)pct_change
(#8137)log1p
to list of mathematical functions (#8102)import polars
speed (#8151)map
lenghts (#8147)UInt64
values that exceed Int64
upper bound (#8146)is_in
(#8139)DataFrame
generation (#8149)Thank you to all our contributors for making this release possible!
@alexander-beedie, @borchero, @dependabot, @dependabot[bot], @jonashaag, @ritchie46 and @stinodego
Published by github-actions[bot] over 1 year ago
n
expression passed to Expr.head/tail (#8098)show_versions
util (#8096)scan_parquet/ipc
and fsspec
(#8071)Thank you to all our contributors for making this release possible!
@MarcoGorelli, @StefanBRas, @alexander-beedie, @josh, @n8henrie, @rben01, @ritchie46, @stinodego, @universalmind303 and @zundertj
Published by github-actions[bot] over 1 year ago
toggle_string_cache
to enable_string_cache
(#7970)sort
, top_k
, sort_by
, and arg_sort_by
, raise if descending
is a sequence and its length doesn't match the number of columns to sort by (#7957)time_unit
/time_zone
instead of tu
/tz
(#7910)struct
, concat_str
, and arg_sort_by
(#7308)shift_and_fill
and add default… (#7192)func
to function
(#7139)Series
/Expr
methods to keyword-only (#7860)FromParalleIter<Option<str>> for Utf8Chunked
~1.9x
(#8058)~2.5x
(#8057)~2x
. (#8053)into_groups
materialization ~-25%
(#8036)~25%
(#7980)DataFrame
init from pyarrow RecordBatch
objects, and improve init from Array
(#8011)write_ipc
to take file=None
(returning BytesIO
) (#7997)Config
methods, reference POLARS_MAX_THREADS in threadpool_size
docstring (#7965)struct
, concat_str
, and arg_sort_by
(#7308)sort
, top_k
, sort_by
, and arg_sort_by
, raise if descending
is a sequence and its length doesn't match the number of columns to sort by (#7957)toggle_string_cache
to enable_string_cache
(#7970)time_unit
/time_zone
instead of tu
/tz
(#7910)shift_and_fill
and add default… (#7192)func
to function
(#7139)Series
/Expr
methods to keyword-only (#7860)Thank you to all our contributors for making this release possible!
@MarcoGorelli, @StefanBRas, @alexander-beedie, @ghuls, @rben01, @ritchie46, @stinodego and @universalmind303
Published by github-actions[bot] over 1 year ago
~10%
(#7938)f
to function
in reduce docstring (#7925)Thank you to all our contributors for making this release possible!
@MarcoGorelli, @alexander-beedie, @alonme, @ankane, @dependabot, @dependabot[bot], @lorentzenchr, @rben01, @ritchie46 and @zundertj
Published by github-actions[bot] over 1 year ago
Series.pow()
(#7898)lit
values (#7879)write_excel
(#7871)Unknown
dtype to proceed as if dtype is None
, to allow inference (#7830)to_repr
methods to DataFrame and Series (#7802)DataFrame
(#7775)map_dict
. (#7797)from_repr
function that reconstructs a DataFrame from its table repr (#7781)aggregation_function
being 'first'
in pivot
. In a future version, it will default to None
(#7784)check_exact
for temporal types in assert_series_equal
(#7896)_repr_html_
escapes column names in addition to data/body elements (#7877)is_between
(#7835)venv
folder to .venv
(#7790)make requirements
option to install/refresh dependencies without having to recreate the venv
(#7792)ruff
target version (#7791)Thank you to all our contributors for making this release possible!
@LdRoW, @MarcoGorelli, @Newtoniano, @advoet, @alexander-beedie, @duskmoon314, @foxcroftjn, @ghuls, @jonashaag, @ritchie46, @stinodego and @zundertj
Published by github-actions[bot] over 1 year ago
min/max/sum
and impl SIMD mean
~2-5x
(#7702)str_replace
for same length replacements ~2x
(#7580)str.replace_n
and add n
argument ~10x
(#7575)replace_literal_all
of single byte replacements ~15x
. (#7565)-25%
(#7501)~20%
(#7500)-40%
(#7498)-~0.15%
(#7494)~3.85x
. (#7478)~2x
improvement (#7347)arr.count_match
expression and optimize arr.sum
for List<Boolean>
(#7023)~5-15%
(#6959)~8-18%
group tuples. (#6956)~2x
(#6861)str.contains
literal matching in the small-string regime (#6811)arg_min/arg_max
(#6799)DataFrame
(#7775)aggregation_function
being 'first'
in pivot
. In a future version, it will default to None
(#7784)qcut
(#7724)maintain_order
option to Series.cut
(#7723)maintain_order
in arr.unique
(#7721)DataFrame.top_k/ LazyFrame.top_k
(#7720)set_fmt_float
value in Config
load/save state (#7696)is_leap_year
to temporal expressions (#7618)groupby/unique
of groupby on integer keys (#7604)LazyFrame.unique
(#7470)LazyFrame.unique
(#7466)low_memory=True
. (#7394)LazyFrame.explode
streamable. (#7341)is_in
exprs (#7169)arr.count_match
expression and optimize arr.sum
for List<Boolean>
(#7023)NullArray
values to python row tuple (#7346)pl.struct
mappable (#7299)str.parse_int
(#7072)vector_hasher
module (#7807)DataFrame.get_columns_mut
as unsafe (#7557)eprintln!
instead of eprint!
(#7473)prec
to precision
(#7401)Expr
(#7258)num
with num-traits
+ a few minor maintenance fixes (#7201)Thank you to all our contributors for making this release possible!
@CloseChoice, @Hofer-Julian, @LdRoW, @MarcoGorelli, @MatveyF, @SauravMaheshkar, @Trippy3, @Vincenthays, @adamgreg, @advoet, @aldanor, @alexander-beedie, @borchero, @chitralverma, @cjackal, @coinflip112, @csko, @datapythonista, @dependabot, @dependabot[bot], @didriksg, @duskmoon314, @ecashin, @foxcroftjn, @ghuls, @iamsmkr, @igmriegel, @jakob-keller, @jonashaag, @josemasar, @josh, @juba, @jvdd, @kngwyu, @minimav, @moritzwilksch, @mslapek, @nrebena, @oysols, @ozgrakkurt, @papparapa, @ptiza, @rben01, @ritchie46, @romanovacca, @s-banach, @sorhawell, @stinodego, @universalmind303, @vincev, @xhochy, @xyning and @zundertj
Published by github-actions[bot] over 1 year ago
Datetime
or Duration
dtype timeunit (#7768)Thank you to all our contributors for making this release possible!
@alexander-beedie, @ghuls, @ritchie46 and @universalmind303
Published by github-actions[bot] over 1 year ago
min/max/sum
and impl SIMD mean
~2-5x
(#7702)write_excel
that adds a row-wise total column using structured references (#7751)min/max
(#7742)concat_list
(#7745)Series.hist
(#7727)qcut
(#7724)maintain_order
option to Series.cut
(#7723)maintain_order
in arr.unique
(#7721)DataFrame.top_k/ LazyFrame.top_k
(#7720)set_fmt_float
value in Config
load/save state (#7696)add
operator-equivalent expression (#7667)is_leap_year
to temporal expressions (#7618)scan_csv
to take a list of column names in a new_columns
param (#7642)groupby/unique
of groupby on integer keys (#7604)is_in
expressions (#7613)Series
init regression from list of np.arange
objects (#7692)__version__
attribute (#7680)Series
init with integer 1/0 values (#7619)Expr.pipe
API docs link (#7734)wrap_x
utils to utils
module (#7672)expr
parsing to utils (#7661)internals
(#7650)internals
(#7649)Thank you to all our contributors for making this release possible!
@MarcoGorelli, @alexander-beedie, @borchero, @chitralverma, @didriksg, @ghuls, @jakob-keller, @minimav, @ritchie46, @stinodego, @universalmind303 and @zundertj
Published by github-actions[bot] over 1 year ago
str_replace
for same length replacements ~2x
(#7580)DataFrame
init by implementing dynamic singledispatch
registration (#7559)str.replace_n
and add n
argument ~10x
(#7575)replace_literal_all
of single byte replacements ~15x
. (#7565)DataFrame
init by implementing dynamic singledispatch
registration (#7559)head
/tail
(#7554)BatchedCsvReader
from public API (#7546)internals
(#7597)lru_cache
to the apply
docstrings (#7593)pli
in type hints (part 2) (#7587)pli
in type hints (part 1) (#7586)fmt
tests to test_fmt
(#7555)sep
arg to separator
(#7533)Thank you to all our contributors for making this release possible!
@MarcoGorelli, @Vincenthays, @alexander-beedie, @ritchie46, @stinodego, @universalmind303 and @vincev
Published by github-actions[bot] over 1 year ago
-25%
(#7501)~20%
(#7500)-40%
(#7498)-~0.15%
(#7494)Decimal
dtype (#7511)show_versions
with xlsxwriter (and add as optional dependency) (#7507)LazyFrame
init in docs (#7508)Thank you to all our contributors for making this release possible!
@CloseChoice, @MarcoGorelli, @alexander-beedie, @ecashin, @ritchie46 and @stinodego
Published by github-actions[bot] over 1 year ago
~3.85x
. (#7478)LazyFrame.unique
(#7470)LazyFrame.unique
(#7466)row_heights
on Excel export (#7447)Excel
export when all data in a multi-column conditional format is contiguous (#7427)Excel
table column/range (#7411)low_memory=True
. (#7394)Excel
export (allows for heatmaps) (#7379)Excel
export (#7380)DataFrame
rendering compatible with quarto and pandoc (#7455)DataFrame
table rendering issue in some Jupyter environments (#7450)Excel
export improvements/fixes (#7363)read_x
functions arg file
to source
(#7460)utils
module (#7435)prec
to precision
(#7401)_base_type
util (#7410)from_x
to data
(#7407)schema
keyword description from `pl.… (#7400)cfg
module to config
(#7385)datatypes
module (#7357)Thank you to all our contributors for making this release possible!
@Hofer-Julian, @MarcoGorelli, @SauravMaheshkar, @aldanor, @alexander-beedie, @cjackal, @ghuls, @josh, @juba, @nrebena, @rben01, @ritchie46, @stinodego and @universalmind303
Published by github-actions[bot] over 1 year ago
~2x
improvement (#7347)LazyFrame.explode
streamable. (#7341)NullArray
values to python row tuple (#7346)write_excel
API docs link (#7338)Thank you to all our contributors for making this release possible!
@alexander-beedie, @ritchie46 and @s-banach
Published by github-actions[bot] over 1 year ago
write_excel
IO method (#7251)Excel
tables (#7333)write_database
(#7322)DataFrame.write_database
) (#7318)expr.apply
streamable in selection context (#7316)unnest
args (#7310)write_excel
IO method (#7251)describe_(optimized)_plan
in favor of explain
(#7264)is_in
exprs (#7169)**named_exprs
input for struct
(#7208)pl.struct
mappable (#7299)str.parse_int
(#7072)every
type is properly normalised (for groupby_dynamic
and groupby_rolling
) (#7238)cols=int
definition respects allowed_dtypes
(#7213)read/write_database
tests (#7327)scan_ds
to scan_pyarrow_dataset
(#7320)read_sql
to read_database
(#7315)git2
vulnerability (#7309)DataFrame.pearson_corr
(#7307)write_excel
doctests (#7306)pytest-xdist
with worksteal (#7304)io
module per type (#7295)_html
module to dataframe module (#7256)strict
for ruff TCH
lints (#7234)DataFrame
and LazyFrame
init params don't diverge (#7214)Thank you to all our contributors for making this release possible!
@MarcoGorelli, @aldanor, @alexander-beedie, @coinflip112, @csko, @dependabot, @dependabot[bot], @ghuls, @josemasar, @josh, @mslapek, @nrebena, @ozgrakkurt, @papparapa, @ptiza, @rben01, @ritchie46, @sorhawell, @stinodego, @universalmind303, @xyning and @zundertj
Published by github-actions[bot] over 1 year ago
sequence_to_pydf
(#7044)LazyFrame
init (same params as DataFrame
) (#7122)base_type
method to DataType
(#7166)explode
args (#7115)_unpack_schema
to prevent potential TypeError (#7128)Thank you to all our contributors for making this release possible!
@MarcoGorelli, @Trippy3, @alexander-beedie, @foxcroftjn, @ghuls, @iamsmkr, @jakob-keller, @josh, @mslapek, @papparapa, @ritchie46, @romanovacca, @stinodego, @universalmind303 and @zundertj
Published by github-actions[bot] over 1 year ago
clear
(#7095)drop
args (#7063)partition_by
args (#7065)exclude
args (#7082)from_records
(#7033)aggregate_fn
to aggregate_function
(#7059)TYPE_CHECKING
lints (#7070)f
/func
to function
(#7032)type: ignore
(#7028)arr.count_match()
(#7029)Thank you to all our contributors for making this release possible!
@MarcoGorelli, @alexander-beedie, @coinflip112, @datapythonista, @jakob-keller, @moritzwilksch, @ritchie46, @stinodego, @universalmind303 and @zundertj
Published by github-actions[bot] over 1 year ago
arr.count_match
expression and optimize arr.sum
for List<Boolean>
(#7023)selection_to_pyexpr_list
(#7020)LazyFrame.with_columns()
(#7019)expr_to_lit_or_expr
for arguments of type Expr
by ~80% (#6967)~5-15%
(#6959)~8-18%
group tuples. (#6956)arr.count_match
expression and optimize arr.sum
for List<Boolean>
(#7023)coalesce
args (#6989)agg
args (#6982)packaging
and/or distutils
dependency with a minimal version parser utility (#6972)over
args (#6986)upper_bound
and lower_bound
methods to Series
(#6990)col
args (#6996)sort
args (#6896)map_dict
method for Series (#6946)pl.lit
value (#6991)median
-> mean
(#6960).struct.to_frame
(#6958)Thank you to all our contributors for making this release possible!
@MarcoGorelli, @MatveyF, @alexander-beedie, @jakob-keller, @mslapek, @ozgrakkurt, @papparapa, @ritchie46, @sorhawell, @stinodego, @xhochy and @zundertj
Published by github-actions[bot] over 1 year ago
is_between
method for Series (#6933)Utf8
to polars dtype (#6885)groupby
args (#6872)date
=> object
typing in to_pandas
method (#6902)PYTHONPATH
from bleeding into polars venv
(#6888)unit.io
tests directory with python io
module (#6889)unit.io
tests directory with python io
module (#6889)datelike
as temporal
, and support Time dtype in Series.to_numpy
(#6881)Self
type more consistently (#6882)Thank you to all our contributors for making this release possible!
@MarcoGorelli, @adamgreg, @alexander-beedie, @josh, @jvdd, @ritchie46 and @stinodego