plydata

A grammar for data manipulation in Python

BSD-3-CLAUSE License

Downloads
1.6K
Stars
276
plydata - v0.4.3 Latest Release

Published by has2k1 almost 4 years ago

Depends on Pandas >= 1.1.5. This version of pandas fixes a bug that made plydata unusable for pandas versions v1.1.0, v1.1.1, v1.1.2, v1.1.3 and v1.1.4.

plydata - v0.4.2

Published by has2k1 almost 4 years ago

Depends on pandas < 1.1.0.

plydata - v0.4.1

Published by has2k1 over 4 years ago

v0.4.1 is a maintenance release with 2 bugs fixed and no new features. See the changelog.

plydata - v0.4.0

Published by has2k1 over 4 years ago

This is a big release that includes two new submodules;

  1. plydata.tidy for verbs that transform between short form and long form dataframes.
  2. plydata.cat_tools for methods that manipulate categoricals.

See the changelog for details.

plydata - v0.3.3

Published by has2k1 about 6 years ago

v0.3.3 is a maintenance release; no new features; one bug fixed.

plydata - v0.3.1

Published by has2k1 almost 7 years ago

A few bug fixes. See the changelog.

plydata - v0.3.0

Published by has2k1 almost 7 years ago

This release a number significant additions and bug fixes. Below is a copy of the changelog.


Bug Fixes

  • Fixed define (mutate) and create (transmute), make them work with group_by.
  • Fixed tally to work with external arrays.
  • Fixed tally to sort in descending order.
  • Fixed the nth function of summarize to return NaN when the requested value is out of bounds.
  • The contains and matches parameters of select can now accept a tuple of values.
  • Fixed verbs that create columns (i.e create, define and do) so that they can create categorical columns.
  • The join verbs gained left_on and right_on parameters.
  • Fixed verb reuse. You can create a verb and assign it to a variable and pipe to the same variable in different operations.
  • Fixed issue where select does maintain the order in which the columns are listed.

New Features

  • Added special verb call, it allows one to use external functions that accept a dataframe as the first argument.

  • For define, create and group_by, you can now use the special function n() to count the number of elements in current group.

  • Added the single table helper verbs:

    • add_count
    • add_tally
    • arrange_all
    • arrange_at
    • arrange_if
    • create_all
    • create_at
    • create_if
    • group_by_all
    • group_by_at
    • group_by_if
    • mutate_all
    • mutate_at
    • mutate_if
    • query_all
    • query_at
    • query_if
    • rename_all
    • rename_at
    • rename_if
    • summarize_all
    • summarize_at
    • summarize_if
  • Added pull verb.

  • Added slice_rows verb.

API Changes

  • Using internal function for summarize that counts the number of elements in the current group changed from {n} to n().
  • You can now use piping with the two table verbs (the joins).
  • modify_where and define_where helper verbs have been removed. Using the new expression helper functions case_when and if_else is more readable.
  • Removed dropna and fillna in favour of using call with pandas.DataFrame.dropna() and pandas.DataFrame.fillna().

plydata - Version 0.1.1

Published by has2k1 over 7 years ago

Re-release of version 0.1.0 without the universal wheels since the project does not support Python 2.

plydata - Version 0.1.0

Published by has2k1 over 7 years ago

First release. All the expected data manipulation verbs for the pandas dataframe are implemented.