kmodes

Python implementations of the k-modes and k-prototypes clustering algorithms, for clustering categorical data

MIT License

Downloads
274.5K
Stars
1.2K
Committers
19

Bot releases are hidden (Show)

kmodes - 0.12.2 Latest Release

Published by nicodv about 2 years ago

What's changed

Full Changelog: https://github.com/nicodv/kmodes/compare/0.12.1...0.12.2

kmodes - 0.12.1

Published by nicodv over 2 years ago

What's changed

Full Changelog: https://github.com/nicodv/kmodes/compare/0.12.0...0.12.1

kmodes - 0.12.0

Published by nicodv over 2 years ago

What's changed

Full Changelog: https://github.com/nicodv/kmodes/compare/0.11.1...0.12.0

kmodes - 0.11.1

Published by nicodv about 3 years ago

What's Changed

Full Changelog: https://github.com/nicodv/kmodes/compare/0.11.0...0.11.1

kmodes - 0.11.0

Published by nicodv over 3 years ago

  • Python 3.9 support
  • Minimum sklearn version upgrade to 0.22
  • Default init method for k-prototypes is now the Cao method (same as k-modes and in line with documentation), courtesy of @larroy
  • Optimizations
kmodes - 0.10.2

Published by nicodv over 4 years ago

  • Added Jaccard dissimilarity function, courtesy of @BikashPandey17 (#129 )
  • Return the costs per epoch after training, courtesy of @daffidwilde (#79 )
  • Python 3.8 now supported
  • Python 3.4 no longer supported because sklearn dropped it too
  • Various bugfixes and improvements
kmodes - 0.9

Published by nicodv over 4 years ago

  • Bugfixes
kmodes - 0.7

Published by nicodv over 4 years ago

  • Categorical variables are now automatically encoded and decoded between original data values and integers (used internally by k-modes). User does not have to use to the categorical variable mapping anymore when looking at the cluster centroids.
  • Support for custom dissimilarity measures
  • Python 3.6 support
  • More robust manual initialization
kmodes - 0.8

Published by nicodv over 4 years ago

  • Huge speedup for k-prototypes, especially for large numbers of samples (#45). A k-prototypes benchmark script is included in examples now.
  • Offer an implementation of Ng's dissimilarity measure, which could improve convergence (#37).
  • Allow pandas DataFrames to be presented to the algorithm, instead of just numpy arrays (#40).
  • Improved handling of dependencies (#49, #53).
  • Various small bugfixes and improvements.
kmodes - 0.10.0

Published by nicodv over 4 years ago

  • Support for more than 256 clusters
  • Optional parallel execution of the multiple initialization runs (courtesy of @rphes )
  • Enhanced error checking when using pandas DataFrames as inputs to the algorithms
  • Various bug fixes and improvements
  • Semantic versioning from now on
kmodes - 0.10.1

Published by nicodv over 4 years ago

  • Improved pandas compatibility, courtesy of @Genie-Liu
  • Forward compatible with future scikit-learn versions that will no longer include joblib, courtesy of @trevorstephens
Package Rankings
Top 16.85% on Conda-forge.org
Top 34.1% on Anaconda.org
Top 17.2% on Spack.io
Top 1.45% on Pypi.org
Related Projects