Python library for using dplyr like syntax with pandas and SQL
MIT License
Bot releases are hidden (Show)
Published by machow about 1 year ago
Full Changelog: https://github.com/machow/siuba/compare/v0.4.3...v0.4.4
Published by machow about 1 year ago
Full Changelog: https://github.com/machow/siuba/compare/v0.4.2...v0.4.3
Published by machow almost 2 years ago
Full Changelog: https://github.com/machow/siuba/compare/v0.4.1...v0.4.2
Published by machow almost 2 years ago
Full Changelog: https://github.com/machow/siuba/compare/v0.4.0...v0.4.1
Published by machow about 2 years ago
See https://github.com/machow/siuba/releases/tag/v0.4.0rc1 for details
Published by machow about 2 years ago
This PR mainly implements the functions tbl()
, across()
, pivot_longer()
, pivot_wider()
. It ensures verbs work with grouped data. As part of implementing across()
, it refactored tidyselection and SQL support.
tbl()
function for pandas and sqla (https://github.com/machow/siuba/pull/446)pivot_longer()
and pivot_wider()
(https://github.com/machow/siuba/pull/440)across()
by (https://github.com/machow/siuba/pull/452)count()
and add_count()
support the name
argument. (#452)Fx
, now exposed as a top level import. This is used to represent operations in across()
and the pivot functions. (#452)lambda _: _.startswith("abc")
was equivalent to _.startswith("abc")
select()
is given a callable function, it passes each column of data to it, and expects back a boolean.select(cars, lambda ser: ser.dtype == "int")
fct_inseq()
, fct_infreq()
(https://github.com/machow/siuba/pull/435)"*"
when possible (https://github.com/machow/siuba/pull/344)pipe()
and call()
functions (https://github.com/machow/siuba/pull/413)
pipe()
allows you to manually create a pipecall()
call you to represent a call to an external function (e.g. call(pd.to_datetime, _.some_col)
)LazyTbl.last_select
to simplify queries (https://github.com/machow/siuba/pull/449).
arrange()
twice now resets the order_by variables set by the first call (matching dbplyr behavior) (#452).add_count()
now correctly handles named arguments, is tested for most cases count()
is tested on (#452)..
.
Full Changelog: https://github.com/machow/siuba/compare/v0.3.0...v0.4.0rc1
Published by machow over 2 years ago
head()
over grouped data by @machow in https://github.com/machow/siuba/pull/420
win_over
by cleaning up sql Over clause code @machow in https://github.com/machow/siuba/pull/412
Full Changelog: https://github.com/machow/siuba/compare/v0.2.3...v0.3.0
Published by machow over 2 years ago
Also adds more tests of calling numpy functions over symbolics
Published by machow over 2 years ago
Full Changelog: https://github.com/machow/siuba/compare/v0.2.1...v0.2.2
Published by machow over 2 years ago
Full Changelog: https://github.com/machow/siuba/compare/v0.2.0...v0.2.1
Published by machow over 2 years ago
@mean.register
def _mean(col: SqlColumn): ...
@mean.register
def _mean(codata: SqlColumn, col): ...
Published by machow over 2 years ago
This is a dev release to support...
Published by machow over 2 years ago
Published by machow over 2 years ago
⚠️ Note: this is a re-release of alpha v1.0.0a3 to be v0.1.1. This will allow users to easily install these releases using pip. Changelog copied below.
Published by machow over 2 years ago
⚠️ Note: this is a re-release of alpha v1.0.0a2 to be v0.1.0. This will allow users to easily install these releases using pip. Changelog copied below.
This is an alpha release for v1.0.0. There will likely be extensive changes through January as I work to refactor the core API.
ops
submodule (and removed spec
submodule).
func
class method. This generates a constructor for a specific sql translation.Published by machow almost 3 years ago
Published by machow almost 3 years ago
This is an alpha release for v1.0.0. There will likely be extensive changes through January as I work to refactor the core API.
ops
submodule (and removed spec
submodule).
func
class method. This generates a constructor for a specific sql translation.Published by machow over 3 years ago
This release adds support for SQLAlchemy 1.4, does some light refactoring, and fixes a slicing issue.
_[1:2, _.a:_.b]
). #325regroup()
, broadcast_agg()
, is_compatible()
). #310pipe(_.some_col)
). #325Published by machow about 4 years ago
Much of this release is setting up for...
-
in sql arrange now produces the correct DESC
, rather than being a -
operation (#280)Published by machow about 4 years ago
anti_join(df1, df2, on = "some_col")
#264