Koalas: pandas API on Apache Spark
APACHE-2.0 License
Bot releases are hidden (Show)
Published by ueshin over 5 years ago
We refined the package management and pushed to conda-forge as well as PyPI. Now we can install Koalas with the conda package manager:
conda install koalas -c conda-forge
We also added the following features:
koalas:
koalas.DataFrame:
koalas.Series:
Along with the following improvements:
Index
/MultiIndex
corresponding to pandas', instead of reusing Series
. (#341)Published by rxin over 5 years ago
We rapidly improved Koalas in documentation and added new functionalities in the past week. As of this release, all functions are documented. We also added the following features:
koalas:
koalas.DataFrame:
koalas.Series:
Along with the following improvements:
Published by ueshin over 5 years ago
We fixed a critical bug for Python 3.5 introduced in v0.2.0. #241
Also we have added the following features:
koalas.DataFrame:
koalas.Series:
and improvements:
koalas.Series:
__add__
and __radd__
now supports string concatenationkoalas.groupby.GroupBy:
agg()
now preserves the group keys as indicesand a lot of code and document cleanups.
Published by rxin over 5 years ago
We have implemented a lot of major functionalities in the past week. Here's a summary of what's new in release v0.2.0.
spark.DataFrame:
koalas.DataFrame:
koalas.Series:
Significantly improved documentation of the project.
Last but not least, we have done some major refactoring of the codebase and its infrastructure to make it more amenable to changes in the future, e.g.
Published by rxin over 5 years ago
We rewrote the internals of Koalas to make it more extensible for upcoming features. We also laid down the foundation for API reference docs in this release.
Published by thunterdb over 5 years ago
This version significantly expands the amount of functions available. It is still meant to be a technology preview, and users are encouraged to report issues that they encounter with their current pandas code.
Noteworthy features:
We thank all the contributors who have contributed to this release.
Published by thunterdb over 5 years ago
This is the initial release outside Databricks.
This release is meant to be a technology preview. See the README.md file for more information.