featuretools

An open source python library for automated feature engineering

BSD-3-CLAUSE License

Downloads
71.4K
Stars
7.1K
Committers
70

Bot releases are hidden (Show)

featuretools - v0.18.1

Published by thehomebrewnerd about 4 years ago

v0.18.1 Aug 12, 2020

  • Fixes
    • Fix EntitySet.plot() when given a dask entityset (#1086)
  • Changes
    • Use nlp-primitives[complete] install for nlp_primitives extra in setup.py (#1103)
  • Documentation Changes
    • Fix broken downloads badge in README.md (#1107)
  • Testing Changes
    • Use CircleCI matrix jobs in config to trigger multiple runs of same job with different parameters (#1105)

Thanks to the following people for contributing to this release:
@gsheni, @systemshift, @thehomebrewnerd

featuretools - v0.18.1.dev0

Published by thehomebrewnerd about 4 years ago

Development release for testing purposes

featuretools - v0.18.0

Published by rwedge about 4 years ago

v0.18.0 July 31, 2020

  • Enhancements
    • Warn user if supplied primitives are not used during dfs (#1073)
  • Fixes
    • Use more consistent and uniform warnings (#1040)
    • Fix issue with missing instance ids and categorical entity index (#1050)
    • Remove warnings.simplefilter in feature_set_calculator to un-silence warnings (#1053)
    • Fix feature visualization for features with '>' or '<' in name (#1055)
    • Fix boolean dtype mismatch between encode_features and dfs and calculate_feature_matrix (#1082)
    • Update primitive options to check reversed inputs if primitive is commutative (#1085)
    • Fix inconsistent ordering of features between kernel restarts (#1088)
  • Changes
    • Make DFS match TimeSince primitive with all Datetime types (#1048)
    • Change default branch to main (#1038)
    • Raise TypeError if improper input is supplied to Entity.delete_variables() (#1064)
    • Updates for compatibility with pandas 1.1.0 (#1079, #1089)
    • Set pandas version to pandas>=0.24.1,<2.0.0. Filter pandas deprecation warning in Week primitive. (#1094)
  • Documentation Changes
    • Remove benchmarks folder (#1049)
    • Add custom variables types section to variables page (#1066)
  • Testing Changes
    • Add fixture for ft.demo.load_mock_customer (#1036)
    • Refactor Dask test units (#1052)
    • Implement automated process for checking critical dependencies (#1045, #1054, #1081)
    • Don't run changelog check for release PRs or automated dependency PRs (#1057)
    • Fix non-deterministic behavior in Dask test causing codecov issues (#1070)

Thanks to the following people for contributing to this release: @frances-h, @gsheni, @monti-python, @rwedge, @systemshift, @tamargrey, @thehomebrewnerd, @wsankey

featuretools - v0.18.0.dev0

Published by rwedge about 4 years ago

Development release for testing purposes

featuretools - v0.17.0

Published by rwedge over 4 years ago

v0.17.0 June 30, 2020

  • Enhancements
    • Add list_variable_types and graph_variable_types for Variable Types (#1013)
    • Add graph_feature to generate a feature lineage graph for a given feature (#1032)
  • Fixes
    • Improve warnings when using a Dask dataframe for cutoff times (#1026)
    • Error if attempting to add entityset relationship where child variable is also child index (#1034)
  • Changes
    • Remove Feature.get_names (#1021)
    • Remove unnecessary pd.Series and pd.DatetimeIndex calls from primitives (#1020, #1024)
    • Improve cutoff time handling when a single value or no value is passed (#1028)
    • Moved find_variable_types to Variable utils (#1013)
  • Documentation Changes
    • Add page on Variable Types to describe some Variable Types, and util functions (#1013)
    • Remove featuretools enterprise from documentation (#1022)
    • Add development install instructions to contributing.md (#1030)
  • Testing Changes
    • Add required flag to CircleCI codecov upload command (#1035)

Thanks to the following people for contributing to this release:
@frances-h, @gsheni, @kmax12, @rwedge, @thehomebrewnerd, @tuethan1999

featuretools - v0.17.0.dev0

Published by rwedge over 4 years ago

Development release for testing purposes

featuretools - v0.16.0

Published by rwedge over 4 years ago

v0.16.0 June 5, 2020

  • Enhancements
    • Support use of Dask DataFrames in entitysets (#783)
    • Add make_index when initializing an EntitySet by passing in an entities dictionary (#1010)
    • Add ability to use primitive classes and instances as keys in primitive_options dictionary (#993)
  • Fixes
    • Cleanly close tqdm instance (#1018)
    • Resolve issue with NaN values in LatLong columns (#1007)
  • Testing Changes
    • Update tests for numpy v1.19.0 compatability (#1016)

Thanks to the following people for contributing to this release:
@Alex-Monahan, @frances-h, @gsheni, @rwedge, @thehomebrewnerd

featuretools - v0.15.0

Published by rwedge over 4 years ago

v0.15.0 May 29, 2020

  • Enhancements

    • Add get_default_aggregation_primitives and get_default_transform_primitives (#945)
    • Allow cutoff time dataframe columns to be in any order (#969, #995)
    • Add Age primitive, and make it a default transform primitive for DFS (#987)
    • Add include_cutoff_time arg - control whether data at cutoff times are included in feature calculations (#959)
    • Allow variables_types to be referenced by their type_string
      for the entity_from_dataframe function (#988)
  • Fixes

    • Fix errors with Equals and NotEquals primitives when comparing categoricals or different dtypes (#968)
    • Normalized type_strings of Variable classes so that the find_variable_types function produces a
      dictionary with a clear key to name transition (#982, #996)
    • Remove pandas.datetime in test_calculate_feature_matrix due to deprecation (#998)
  • Documentation Changes

    • Add python 3.8 support for docs (#983)
    • Adds consistent Entityset Docstrings (#986)
  • Testing Changes

    • Add automated tests for python 3.8 environment (#847)
    • Update testing dependencies (#976)

    Thanks to the following people for contributing to this release:
    @ctduffy, @frances-h, @gsheni, @jeff-hernandez, @rightx2, @rwedge, @sebrahimi1988, @thehomebrewnerd, @tuethan1999

Breaking Changes

  • Calls to featuretools.dfs or featuretools.calculate_feature_matrix that use a cutoff time
    dataframe, but do not label the time column with either the target entity time index variable name or
    as time, will now result in an AttributeError. Previously, the time column was selected to be the first
    column that was not the instance id column. With this update, the position of the column in the dataframe is
    no longer used to determine the time column. Now, both instance id columns and time columns in a cutoff time
    dataframe can be in any order as long as they are named properly.

  • The type_string attributes of all Variable subclasses are now a snake case conversion of their class names. This
    changes the type_string of the Unknown, IPAddress, EmailAddress, SubRegionCode, FilePath, LatLong, and ZIPcode classes.
    Old saved entitysets that used these variables may load incorrectly.

featuretools - v0.14.0

Published by frances-h over 4 years ago

v0.14.0 Apr 30, 2020

  • Enhancements
    • ft.encode_features - use less memory for one-hot encoded columns (#876)
  • Fixes
    • Use logger.warning to fix deprecated logger.warn (#871)
    • Add dtype to interesting_values to fix deprecated empty Series with no dtype (#933)
    • Remove overlap in training windows (#930)
    • Fix progress bar in notebook (#932)
  • Changes
    • Change premium primitives CI test to Python 3.6 (#916)
    • Remove Python 3.5 support (#917)
  • Documentation Changes
    • Fix README links to docs (#872)
    • Fix Github links with correct organizations (#908)
    • Fix hyperlinks in docs and docstrings with updated address (#910)
    • Remove unused script for uploading docs to AWS (#911)

Thanks to the following people for contributing to this release:
@frances-h, @gsheni, @jeff-hernandez, @rwedge

featuretools - v0.14.0.dev0

Published by frances-h over 4 years ago

Development release for testing purposes

featuretools - v0.13.4

Published by frances-h over 4 years ago

v0.13.4 Mar 27, 2020

  • Fixes
    • Fix ft.show_info() not displaying in Jupyter notebooks (#863)
  • Changes
    • Added Plugin Warnings at Entry Point (#850, #869)
  • Documentation Changes
    • Add links to primitives.featurelabs.com (#860)
    • Add source code links to API reference (#862)
    • Update links for testing Dask/Spark integrations (#867)
    • Update release documentation for featuretools (#868)
  • Testing Changes
    • Miscellaneous changes (#861)

Thanks to the following people for contributing to this release: @frances-h, @FreshLeaf8865, @jeff-hernandez, @rwedge, @thehomebrewnerd

featuretools - v0.13.4.dev0

Published by frances-h over 4 years ago

Development release for testing purposes

featuretools - v0.13.3

Published by rwedge over 4 years ago

v0.13.3 Feb 28, 2020

  • Fixes
    • Fix a connection closed error when using n_jobs (#853)
  • Changes
    • Pin msgpack dependency for Python 3.5; remove dataframe from Dask dependency (#851)
  • Documentation Changes
    • Update link to help documentation page in Github issue template (#855)

Thanks to the following people for contributing to this release:
@frances-h, @rwedge

featuretools - v0.13.3.dev0

Published by rwedge over 4 years ago

Development release for testing purposes

featuretools - v0.13.2

Published by rwedge over 4 years ago

v0.13.2 Jan 31, 2020

  • Enhancements
    • Support for Pandas 1.0.0 (#844)
  • Changes
    • Remove dependency on s3fs library for anonymous downloads from S3 (#825)
  • Testing Changes
    • Added GitHub Action to automatically run performance tests (#840)

Thanks to the following people for contributing to this release:
@frances-h, @rwedge

featuretools - v0.13.2.dev0

Published by rwedge over 4 years ago

Test release for development purposes

featuretools - v0.13.1

Published by rwedge almost 5 years ago

v0.13.1 Dec 28, 2019

  • Enhancements
  • Fixes
    • Raise error when given wrong input for ignore_variables (#826)
    • Fix multi-output features not created when there is no child data (#834)
    • Removing type casting in Equals and NotEquals primitives (#504)
  • Changes
    • Replace pd.timedelta time units that were deprecated (#822)
    • Move sklearn wrapper to separate library (#835, #837)
  • Documentation Changes
  • Testing Changes
    • Run unit tests in windows environment (#790)
    • Update boto3 version requirement for tests (#838)

Thanks to the following people for contributing to this release:
@jeffzi, @kmax12, @rwedge, @systemshift

featuretools - v0.13.1.dev0

Published by rwedge almost 5 years ago

Test release for development purposes

featuretools - v0.13.0

Published by rwedge almost 5 years ago

v0.13.0 Nov 30, 2019

  • Enhancements
    • Added GitHub Action to auto upload releases to PyPI (#816)
  • Fixes
    • Fix issue where some primitive options would not be applied (#807)
    • Fix issue with converting to pickle or parquet after adding interesting features (#798, #823)
    • Diff primitive now calculates using all available data (#824)
    • Prevent DFS from creating Identity Features of globally ignored variables (#819)
  • Changes
    • Remove python 2.7 support from serialize.py (#812)
    • Make smart_open, boto3, and s3fs optional dependencies (#827)
  • Documentation Changes
    • remove python 2.7 support and add 3.7 in install.rst (#805)
    • Fix import error in docs (#803)
    • Fix release title formatting in changelog (#806)
  • Testing Changes
    • Use multiple CPUS to run tests on CI (#811)
    • Refactor test entityset creation to avoid saving to disk (#813, #821)
    • Remove get_values() from test_es.py to remove warnings (#820)

Thanks to the following people for contributing to this release:
@frances-h, @jeff-hernandez, @rwedge, @systemshift

Breaking Changes

  • The libraries used for downloading or uploading from S3 or URLs are now
    optional and will no longer be installed by default. To use this
    functionality they will need to be installed separately.
  • The fix to how the Diff primitive is calculated may slow down the overall
    calculation time of feature lists that use this primitive.
featuretools - v0.13.0.dev0

Published by rwedge almost 5 years ago

Test release for development purposes

Package Rankings
Top 19.62% on Anaconda.org
Top 0.99% on Pypi.org
Top 5.19% on Proxy.golang.org
Top 8.11% on Conda-forge.org
Related Projects