tpot

A Python Automated Machine Learning tool that optimizes machine learning pipelines using genetic programming.

LGPL-3.0 License

Downloads

43.7K

Stars

9.5K

Committers

View Code on GitHub Visit Website

Ecosystems: scikit-learn, Python

Bot releases are hidden (Show)

tpot - v0.12.2 Latest Release

Published by jay-m-dev 8 months ago

What's Changed

estimator type by @perib in https://github.com/EpistasisLab/tpot/pull/1319
Update requirements by @gatl in https://github.com/EpistasisLab/tpot/pull/1335
Bump torch from 1.3.1 to 1.13.1 by @dependabot in https://github.com/EpistasisLab/tpot/pull/1336
update sklearn version to 1.1.3 by @perib in https://github.com/EpistasisLab/tpot/pull/1337
remove deprecated imp, fix docstring warning by @perib in https://github.com/EpistasisLab/tpot/pull/1331
update compatibility with scikitlearn 1.4 by @perib in https://github.com/EpistasisLab/tpot/pull/1343
Improve error message by @gatl in https://github.com/EpistasisLab/tpot/pull/1338
Mate operator fix by @perib in https://github.com/EpistasisLab/tpot/pull/1268

New Contributors

@gatl made their first contribution in https://github.com/EpistasisLab/tpot/pull/1335
@dependabot made their first contribution in https://github.com/EpistasisLab/tpot/pull/1336

Full Changelog: https://github.com/EpistasisLab/tpot/compare/v0.12.1...v0.12.2

tpot - v0.12.1

Published by jay-m-dev about 1 year ago

Fixes issue with runs terminating too early

What's Changed

Early crash fix by @perib in https://github.com/EpistasisLab/tpot/pull/1315
Full Changelog: https://github.com/EpistasisLab/tpot/compare/v0.12.0...v0.12.1

tpot - v0.12.0 release

Published by nickotto over 1 year ago

Fix numpy compatibility
Dask optimizations
Minor bug fixes

tpot - v0.11.7 minor release

Published by weixuanfu almost 4 years ago

Fix compatibility issue with scikit-learn 0.24 and xgboost 1.3.0
Fix a bug causing that TPOT does not work when classifying more than 50 classes
Add initial support Resampler from imblearn
Fix minor bugs

tpot - 0.11.6.post3

Published by weixuanfu almost 4 years ago

A patch to fix compatibility issues with the latest version of xgboost (v1.3.0)

tpot - v0.11.6.post2

Published by weixuanfu almost 4 years ago

make XGBoost as a required dependency

tpot - v0.11.6.post1

Published by weixuanfu almost 4 years ago

Refine the logic of checking the type of an operator.

tpot - Version 0.11.6

Published by weixuanfu almost 4 years ago

Fix a bug causing point mutation function does not work properly with using template option
Add a new built configuration called "TPOT cuML" which TPOT will search over a restricted configuration using the GPU-accelerated estimators in RAPIDS cuML and DMLC XGBoost. This configuration requires an NVIDIA Pascal architecture or better GPU with compute capability 6.0+, and that the library cuML is installed.
Add string path support for log/log_file parameter
Fix a bug in version 0.11.5 causing no update in stdout after each generation
Fix minor bugs

tpot - Covariate adjustments branch

Published by weixuanfu about 4 years ago

Development branch based on TPOT 0.11.1 for adjusting covariate without data leakage.

tpot - v0.11.5

Published by weixuanfu over 4 years ago

Make Pytorch as an optional dependency
Refine installation documentation

tpot - TPOT v0.11.4 minor release

Published by weixuanfu over 4 years ago

Add a new built configuration "TPOT NN" which includes all operators in "Default TPOT" plus additional neural network estimators written in PyTorch (currently tpot.builtins.PytorchLRClassifier and tpot.builtins.PytorchMLPClassifier for classification tasks only)
Refine log_file parameter's behavior

tpot - TPOT v0.11.3 minor release

Published by weixuanfu over 4 years ago

Fix a bug in TPOTRegressor in v0.11.2
Add -log option in command line interface to save process log to a file.

tpot - TPOT v0.11.2 Minor Release

Published by weixuanfu over 4 years ago

Fix early_stop parameter does not work properly
TPOT built-in OneHotEncoder can refit to different datasets
Fix the issue that the attribute evaluated_individuals_ cannot record correct generation info.
Add a new parameter log_file to output logs to a file instead of sys.stdout
Fix some code quality issues and mistakes in documentations
Fix minor bugs

tpot - TPOT v0.11.1 Minor Release

Published by weixuanfu almost 5 years ago

Fix compatibility issue with scikit-learn v0.22
warm_start now saves both Primitive Sets and evaluated_pipelines_ from previous runs;
Fix the error that TPOT assign wrong fitness scores to non-evaluated pipelines (interrupted by max_min_mins or KeyboardInterrupt) ;
Fix the bug that mutation operator cannot generate new pipeline when template is not default value and warm_start is True;
Fix the bug that max_time_mins cannot stop optimization process when search space is limited.
Fix a bug in exported codes when the exported pipeline is only 1 estimator
Fix spelling mistakes in documentations
Fix some code quality issues

tpot - Version 0.11.0

Published by weixuanfu almost 5 years ago

Support for Python 3.4 and below has been officially dropped. Also support for scikit-learn 0.20 or below has been dropped.
The support of a metric function with the signature score_func(y_true, y_pred) for scoring parameter has been dropped.
Refine StackingEstimator for not stacking NaN/Infinity predication probabilities.
Fix a bug that population doesn't persist even warm_start=True when max_time_mins is not default value.
Now the random_state parameter in TPOT is used for pipeline evaluation instead of using a fixed random seed of 42 before. The set_param_recursive function has been moved to export_utils.py and it can be used in exported codes for setting random_state recursively in scikit-learn Pipeline. It is used to set random_state in fitted_pipeline_ attribute and exported pipelines.
TPOT can independently use generations and max_time_mins to limit the optimization process through using one of the parameters or both.
.export() function will return string of exported pipeline if output filename is not specified.
Add SGDClassifier and SGDRegressor into TPOT default configs.
Documentation has been updated.
Fix minor bugs.

tpot - TPOT v0.10.2 minor release

Published by weixuanfu over 5 years ago

TPOT v0.10.2 is the last version to support Python 2.7 and Python 3.4.
Minor updates for fixing compatibility issues with the latest version of scikit-learn (version > 0.21) and xgboost (v0.90)
Default value of template parameter is changed to None instead.
Fix errors in documentation

tpot - TPOT v0.10.1 minor release

Published by weixuanfu over 5 years ago

Add data_file_path option into expert function for replacing 'PATH/TO/DATA/FILE' to customized dataset path in exported scripts. (Related issue #838)
Change python version in CI tests to 3.7
Add CI tests for macOS.

tpot - TPOT 0.10.0 Release

Published by weixuanfu over 5 years ago

Add a new template option to specify a desired structure for machine learning pipeline in TPOT. Check TPOT API (it will be updated once it is merge to master branch).
Add FeatureSetSelector operator into TPOT for feature selection based on priori export knowledge. Please check our preprint paper for more details (Note: it was named DatasetSelector in 1st version paper but we will rename to FeatureSetSelector in next version of the paper)
Refine n_jobs parameter to accept value below -1. For n_jobs below -1, (n_cpus + 1 + n_jobs) are used. Thus for n_jobs = -2, all CPUs but one are used. It is related to the issue #846.
Now memory parameter can create memory cache directory if it does not exist. It is related to the issue #837.
Fix minor bugs.

tpot - TPOT 0.9.6 Minor Release

Published by weixuanfu over 5 years ago

Fix a bug causing that max_time_mins parameter doesn't work when use_dask=True in TPOT 0.9.5
Now TPOT saves best pareto values best pareto pipeline s in checkpoint folder
TPOT raises ImportError if operators in the TPOT configuration are not available when verbosity>2
Thank @PGijsbers for the suggestions. Now TPOT can save scores of individuals already evaluated in any generation even the evaluation process of that generation is interrupted/stopped. But it is noted that, in this case, TPOT will raise this warning message: WARNING: TPOT may not provide a good pipeline if TPOT is stopped/interrupted in a early generation., because the pipelines in early generation, e.g. 1st generation, are evolved/modified very limited times via evolutionary algorithm.
Fix bugs in configuration of TPOTRegressor
Error fixes in documentation

tpot - TPOT now supports integration with Dask for parallelization

Published by weixuanfu about 6 years ago

TPOT now supports integration with Dask for parallelization + smart caching. Big thanks to the Dask dev team for making this happen!
TPOT now supports for imputation/sparse matrices into predict and predict_proba functions.
TPOTClassifier and TPOTRegressor now follows scikit-learn estimator API.
We refined scoring parameter in TPOT API for accepting Scorer object.
We refined parameters in VarianceThreshold and FeatureAgglomeration.
TPOT now supports using memory caching within a Pipeline via a optional memory parameter.
We improved documentation of TPOT.

Package Rankings

Top 4.06% on Proxy.golang.org

Top 7.29% on Conda-forge.org

Top 1.21% on Pypi.org

Badges

Extracted from project README

Master Build Status - Mac/Linux

Master Build Status - Windows

Master Coverage Status

Development Build Status - Mac/Linux

Development Build Status - Windows

Development Coverage Status

Related Projects

giotto-tda

A high-performance topological machine learning toolbox in Python

15 Oct 2019 836

sklearn-porter

Transpile trained scikit-learn estimators to C, Java, JavaScript and others.

22 Jun 2016 1,284

topometry

Systematically learn and evaluate manifolds from high-dimensional data

scikit-mdr

A sklearn-compatible Python implementation of Multifactor Dimensionality Reduction (MDR) for feat...

07 Jun 2016 126

robert

Automated machine learning protocols that start from CSV databases of descriptors or SMILES and p...

dtreeviz

A python library for decision tree visualization and model interpretation.

13 Aug 2018 2,851

DAT8

General Assembly's 2015 Data Science course in Washington, DC

07 Aug 2015 1,606

imbalanced-ensemble

🛠️ Class-imbalanced Ensemble Learning Toolbox. | 类别不平衡/长尾机器学习库

10 Mar 2021 321

data-science-ipython-notebooks

Data science Python notebooks: Deep learning (TensorFlow, Theano, Caffe, Keras), scikit-learn, Ka...

23 Jan 2015 27,232

Sklearn-genetic-opt

ML hyperparameters tuning and features selection, using evolutionary algorithms.

18 Jan 2020 307

adapt

Awesome Domain Adaptation Python Toolbox

25 Jun 2020 302

Neuraxle

The world's cleanest AutoML library ✨ - Do hyperparameter tuning with the right pipeline abstract...

26 Mar 2019 606

EC-KitY

EC-KitY: A scikit-learn-compatible Python tool kit for doing evolutionary computation.

cblearn

Comparison-based Machine Learning in Python

imodels

Interpretable ML package 🔍 for concise, transparent, and accurate predictive modeling (sklearn-co...

04 Jul 2019 1,372