featuretools

An open source python library for automated feature engineering

BSD-3-CLAUSE License

Downloads
71.4K
Stars
7.1K
Committers
70

Bot releases are hidden (Show)

featuretools - v1.31.0 Latest Release

Published by thehomebrewnerd 5 months ago

v1.31.0 May 14, 2024

  • Enhancements
    • Add support for Python 3.12 (#2713)
  • Fixes
    • Move flatten_list util function into feature_discovery module to fix import bug (#2702)
  • Changes
    • Temporarily restrict Dask version (#2694)
    • Remove support for creating EntitySets from Dask or Pyspark dataframes (#2705)
    • Bump minimum versions of tqdm and pip in requirements files (#2716)
    • Use filter arg in call to tarfile.extractall to safely deserialize EntitySets (#2722)
  • Testing Changes
    • Fix serialization test to work with pytest 8.1.1 (#2694)
    • Update to allow minimum dependency checker to run properly (#2709)
    • Update pull request check CI action (#2720)
    • Update release notes updated check CI action (#2726)

Thanks to the following people for contributing to this release:
@thehomebrewnerd

featuretools -

Published by tamargrey 8 months ago

v1.30.0 Feb 26, 2024

  • Changes

    • Update min requirements for numpy, pandas and Woodwork (#2681)
    • Update release notes version for release (#2689)
  • Testing Changes

    • Update make_ecommerce_entityset to work without Dask (#2677)

    Thanks to the following people for contributing to this release:
    @tamargrey, @thehomebrewnerd

featuretools - v1.29.0

Published by tamargrey 8 months ago

v1.29.0 Feb 16, 2024

warning:

This release of Featuretools will not support Python 3.8

  • Fixes
    • Fix dependency issues (#2644, #2656)
    • Add workaround for pandas 2.2.0 bug with nunique and unpin pandas deps (#2657)
  • Changes
    • Fix deprecation warnings with is_categorical_dtype (#2641)
    • Remove woodwork, pyarrow, numpy, and pandas pins for spark installation (#2661)
  • Documentation Changes
    • Update Featuretools logo to display properly in dark mode (#2632)
    • Remove references to premium primitives while release isnt possible (:pr:2674)
  • Testing Changes
    • Update tests for compatibility with new versions of holidays (#2636)
    • Update ruff to 0.1.6 and use ruff linter/formatter (#2639)
    • Update release.yaml to use trusted publisher for PyPI releases (#2646, #2653, #2654)
    • Update dependency checkers and tests to include Dask (#2658)
    • Fix the tests that run with Woodwork main so they can be triggered (#2657)
    • Fix minimum dependency checker action (#2664)
    • Fix Slack alert for tests with Woodwork main branch (#2668)

Thanks to the following people for contributing to this release:
@gsheni, @thehomebrewnerd, @tamargrey, @LakshmanKishore

featuretools - v1.28.0

Published by thehomebrewnerd 12 months ago

v1.28.0 Oct 26, 2023

  • Fixes
    • Fix bug with default value in PercentTrue primitive (#2627)
  • Changes
    • Refactor featuretools/tests/primitive_tests/utils.py to leverage list comprehensions for improved Pythonic quality (#2607)
    • Refactor can_stack_primitive_on_inputs (#2522)
    • Update s3 bucket for docs image (#2593)
    • Temporarily restrict pandas max version to <2.1.0 and pyarrow to <13.0.0 (#2609)
    • Update for compatibility with pandas version 2.1.0 and remove pandas upper version restriction (#2616)
  • Documentation Changes
    • Fix badge on README for tests (#2598)
    • Update readthedocs config to use build.os (#2601)
  • Testing Changes
    • Update airflow looking glass performance tests workflow (#2615)
    • Removed old performance testing workflow (#2620)

Thanks to the following people for contributing to this release:
@gsheni, @petejanuszewski1, @thehomebrewnerd, @tosemml

featuretools - v1.27.0

Published by thehomebrewnerd about 1 year ago

v1.27.0 Jul 24, 2023

  • Enhancements
    • Add support for Python 3.11 (#2583)
    • Add support for pandas v2.0 (#2585)
  • Changes
    • Remove natural language primitives add-on (#2570)
    • Updates to address various warnings (#2589)
  • Testing Changes
    • Run looking glass performance tests on merge via Airflow (#2575)

Thanks to the following people for contributing to this release:
@gsheni, @petejanuszewski1, @sbadithe, @thehomebrewnerd

featuretools - v1.26.0

Published by dvreed77 over 1 year ago

v1.26.0 Apr 27, 2023

  • Enhancements
    • Introduce New Single-Table DFS Algorithm (#2516). This includes experimental functionality and is not officially supported.
    • Add premium primitives install command (#2545)
  • Fixes
    • Fix Description of DaysInMonth (#2547)
  • Changes
    • Make Dask an optional dependency (#2560)

Thanks to the following people for contributing to this release:
@dvreed77, @gsheni, @thehomebrewnerd

featuretools - v1.25.0

Published by gsheni over 1 year ago

v1.24.0 Apr 13, 2023

  • Enhancements

    • Add MaxCount, MedianCount, MaxMinDelta, NUniqueDays, NMostCommonFrequency,
      NUniqueDaysOfCalendarYear, NUniqueDaysOfMonth, NUniqueMonths,
      NUniqueWeeks, IsFirstWeekOfMonth (#2533)
    • Add HasNoDuplicates, NthWeekOfMonth, IsMonotonicallyDecreasing, IsMonotonicallyIncreasing,
      IsUnique (#2537)
  • Changes

    • Restrict pandas to < 2.0.0 (#2533)
    • Upgrade minimum pandas to 1.5.0 (#2537)
    • Removed the Correlation and AutoCorrelation primitive as these could lead to data leakage (#2537)
    • Remove IntegerNullable support for Kurtosis primitive (#2537)

    Thanks to the following people for contributing to this release:
    @gsheni

featuretools - v1.24.0

Published by ozzieD over 1 year ago

v1.24.0 Mar 29, 2023

  • Enhancements
    • Add AverageCountPerUnique, CountryCodeToContinent, FileExtension, FirstLastTimeDelta, SavgolFilter, CumulativeTimeSinceLastFalse, CumulativeTimeSinceLastTrue, PercentChange, PercentUnique (#2485)
    • Add FullNameToFirstName, FullNameToLastName, FullNameToTitle, AutoCorrelation, Correlation, DateFirstEvent (#2526)
    • Add Kurtosis, MinCount, NumFalseSinceLastTrue, NumPeaks, NumTrueSinceLastFalse, NumZeroCrossings (#2514)
  • Fixes
    • Pin github-action-check-linked-issues to 1.4.5 (#2497)
    • Support Woodwork's update numeric inference (integers as strings) (#2505)
    • Update SubtractNumeric Primitive with commutative class property (#2527)
  • Changes
    • Separate Makefile command for core requirements, test requirements and dev requirements (#2518)

Thanks to the following people for contributing to this release:
@dvreed77, @gsheni, @ozzieD

featuretools - v1.23.0

Published by dvreed77 over 1 year ago

v1.23.0 Feb 15, 2023

  • Changes
    • Change TotalWordLength and UpperCaseWordCount to return IntegerNullable (#2474)
  • Testing Changes
    • Add GitHub Actions cache to speed up workflows (#2475)
    • Fix latest dependency checker install command (#2476)
    • Add pull request check for linked issues to CI workflow (#2477, #2481)
    • Remove make package from lint workflow (#2479)

Thanks to the following people for contributing to this release:
@dvreed77, @gsheni, @sbadithe

featuretools - v1.22.0

Published by sbadithe over 1 year ago

  • Enhancements

    • Add AbsoluteDiff, SameAsPrevious, Variance, Season, UpperCaseWordCount transform primitives (#2460)
  • Fixes

    • Fix bug with consecutive spaces in NumWords (#2459)
    • Fix for compatibility with holidays v0.19.0 (#2471)
  • Changes

    • Specify black and ruff config arguments in pre-commit-config (#2456)
    • NumCharacters returns null given null input (#2463)
  • Documentation Changes

    • Update release.md with instructions for launching Looking Glass performance test runs (#2461)
    • Pin jupyter-client==7.4.9 to fix broken documentation build (#2463)
    • Unpin jupyter-client documentation requirement (#2468)
  • Testing Changes

    • Add test suites for NumWords and NumCharacters primitives (#2459, #2463)

    Thanks to the following people for contributing to this release:
    @gsheni, @rwedge, @sbadithe, @thehomebrewnerd

featuretools - v1.21.0

Published by ozzieD over 1 year ago

Jan 18, 2023

  • Enhancements
    • Add get_recommended_primitives function to featuretools (#2398)
  • Changes
    • Update build_docs workflow to only run for Python 3.8 and Python 3.10 (#2447)
  • Documentation Changes
    • Minor fix to release notes (#2444)
  • Testing Changes
    • Add test that checks for Natural Language primitives timing out against edge-case input (#2429)
    • Fix test compatibility with composeml 0.10 (#2439)
    • Minimum dependency unit test jobs do not abort if one job fails (#2437)
    • Run Looking Glass performance tests on merge to main (#2440, #2441)
    • Add ruff for linting and replace isort/flake8 (#2448)

Thanks to the following people for contributing to this release:
@gsheni, @ozzieD, @rwedge, @sbadithe, @thehomebrewnerd

featuretools - v1.20.0

Published by thehomebrewnerd almost 2 years ago

Jan 5, 2023

  • Enhancements
    • Add TimeSinceLastFalse, TimeSinceLastMax, TimeSinceLastMin, and TimeSinceLastTrue primitives (#2418)
    • Add MaxConsecutiveFalse, MaxConsecutiveNegatives, MaxConsecutivePositives, MaxConsecutiveTrue, MaxConsecutiveZeros, NumConsecutiveGreaterMean, NumConsecutiveLessMean (#2420)
  • Fixes
    • Fix typo in _handle_binary_comparison function name and update set_feature_names docstring (#2388)
    • Only allow Datetime time index as input to RateOfChange primitive (#2408)
    • Prevent catastrophic backtracking in regex for NumberOfWordsInQuotes (#2413)
    • Fix to eliminate fragmentation PerformanceWarning in feature_set_calculator.py (#2424)
    • Fix serialization of NumberOfCommonWords feature with custom word_set (#2432)
    • Improve edge case handling in NaturalLanguage primitives by standardizing delimiter regex (#2423)
    • Remove support for Datetime and Ordinal inputs in several primitives to prevent creation of Features that cannot be calculated (#2434)
  • Changes
    • Refactor _all_direct_and_same_path by deleting call to _features_have_same_path (#2400)
    • Refactor _build_transform_features by iterating over input_features once (#2400)
    • Iterate only once over ignore_columns in DeepFeatureSynthesis init (#2397)
    • Resolve empty Pandas series warnings (#2403)
    • Initialize Woodwork with init_with_partial_schama instead of init in EntitySet.add_last_time_indexes (#2409)
    • Updates for compatibility with numpy 1.24.0 (#2414)
    • The delimiter_regex parameter for TotalWordLength has been renamed to do_not_count (#2423)
  • Documentation Changes
    • Remove unused sections from 1.19.0 notes (#2396)

Thanks to the following people for contributing to this release:
@gsheni, @rwedge, @sbadithe, @thehomebrewnerd

Breaking Changes

  • The delimiter_regex parameter for TotalWordLength has been renamed to do_not_count.
    Old saved features that had a non-default value for the parameter will no longer load.
  • Support for Datetime and Ordinal inputs has been removed from the LessThanScalar,
    GreaterThanScalar, LessThanEqualToScalar and GreaterThanEqualToScalar primitives.
featuretools - v1.19.0

Published by thehomebrewnerd almost 2 years ago

v1.19.0 Dec 9, 2022

  • Enhancements
    • Add OneDigitPostalCode and TwoDigitPostalCode primitives (#2365)
    • Add ExpandingCount, ExpandingMin, ExpandingMean, ExpandingMax, ExpandingSTD and ExpandingTrend primitives (#2343)
  • Fixes
    • Fix DeepFeatureSynthesis to consider the base_of_exclude family of attributes when creating transform features(#2380)
    • Fix bug with negative version numbers in test_version (#2389)
    • Fix bug in MultiplyNumericBoolean primitive that can cause an error with certain input dtype combinations (#2393)
  • Testing Changes
    • Fix version comparison in test_holiday_out_of_range (#2382)

Thanks to the following people for contributing to this release:
@sbadithe, @thehomebrewnerd

featuretools - v1.18.0

Published by gsheni almost 2 years ago

v1.18.0 Nov 15, 2022

  • Enhancements
    • Add RollingOutlierCount primitive (#2129)
    • Add RateOfChange primitive (#2359)
  • Fixes
    • Sets uses_full_dataframe for Rolling* and Exponential* primitives (#2354)
    • Updates for compatibility with upcoming Woodwork release 0.21.0 (#2363)
    • Updates demo dataset location to use new links (#2366)
    • Fix test_holiday_out_of_range after holidays release 0.17 (#2373)
  • Changes
    • Remove click and CLI functions (list-primitives, info) (#2353, #2358)
  • Documentation Changes
    • Build docs in parallel with Sphinx (#2351)
    • Use non-editable install to allow local docs build (#2367)
    • Remove primitives.featurelabs.com website from documentation (#2369)
  • Testing Changes
    • Replace use of pytest's tmpdir fixture with tmp_path (#2344)

Thanks to the following people for contributing to this release:
@gsheni, @rwedge, @sbadithe, @tamargrey, @thehomebrewnerd

featuretools - v1.17.0

Published by gsheni almost 2 years ago

v1.17.0 Oct 31, 2022

  • Enhancements

    • Add featuretools-sklearn-transformer as an extra installation option (#2335)
    • Add CountAboveMean, CountBelowMean, CountGreaterThan, CountInsideNthSTD, CountInsideRange, CountLessThan, CountOutsideNthSTD, CountOutsideRange (#2336)
  • Changes

    • Restructure primitives directory to use individual primitives files (#2331)
    • Restrict 2022.10.1 for dask and distributed (#2347)
  • Documentation Changes

    • Add Featuretools-SQL to Install page on documentation (#2337)
    • Fixes broken link in Featuretools documentation (#2339)

    Thanks to the following people for contributing to this release:
    @gsheni, @rwedge, @sbadithe, @thehomebrewnerd

featuretools - v1.16.0

Published by sbadithe almost 2 years ago

  • Enhancements
    • Add ExponentialWeighted primitives and DateToTimeZone primitive (#2318)
    • Add 14 natural language primitives from nlp_primitives library (#2328)
  • Documentation Changes
    • Fix typos in aggregation_primitive_base.py and features_deserializer.py (#2317) (#2324)
    • Update SQL integration documentation to reflect Snowflake compatibility (#2313)
  • Testing Changes
    • Add Windows install test #2330

Thanks to the following people for contributing to this release:
@gsheni, @sbadithe, @thehomebrewnerd

featuretools - v1.15.0

Published by rwedge about 2 years ago

v1.15.0 Oct 6, 2022

  • Enhancements
    • Add series_library attribute to EntitySet dictionary (#2257)
    • Leverage Library Enum inheriting from str (#2275)
  • Changes
    • Change default gap for Rolling* primitives from 0 to 1 to prevent accidental leakage (#2282)
    • Updates for pandas 1.5.0 compatibility (#2290, #2291, #2308)
    • Exclude documentation files from release workflow (#2295)
    • Bump requirements for optional pyspark dependency (#2299)
    • Bump scipy and woodwork[spark] dependencies (#2306)
  • Documentation Changes
    • Add documentation describing how to use featuretools_sql with featuretools (#2262)
    • Remove featuretools_sql as a docs requirement (#2302)
    • Fix typo in DiffDatetime doctest (#2314)
    • Fix typo in EntitySet documentation (#2315)
  • Testing Changes
    • Remove graphviz version restrictions in Windows CI tests (#2285)
    • Run CI tests with pytest -n auto (#2298, #2310)

Thanks to the following people for contributing to this release:
@gsheni, @rwedge, @sbadithe, @thehomebrewnerd

Breaking Changes

  • The EntitySet schema has been updated to include a series_library attribute
  • The default behavior of the Rolling* primitives has changed in this release. If this primitive was used without
    defining the gap value, the feature values returned with this release will be different than feature values from
    prior releases.
featuretools - v1.15.0.dev0

Published by rwedge about 2 years ago

Developmental release for testing purposes

featuretools - v1.14.0

Published by stefaniesmith about 2 years ago

v1.14.0 Sep 1, 2022

  • Enhancements
    • Replace NumericLag with Lag primitive (#2252)
    • Refactor build_features to speed up long running DFS calls by 50% (#2224)
  • Fixes
    • Fix compatibility issues with holidays 0.15 (#2254)
  • Changes
    • Update release notes to make clear conda release portion (#2249)
    • Use pyproject.toml only (move away from setup.cfg) (#2260, #2263, #2265)
    • Add entry point instructions for pyproject.toml project (#2272)
  • Documentation Changes
    • Fix to remove warning from Using Spark EntitySets Guide (#2258)
  • Testing Changes
    • Add tests/profiling/dfs_profile.py (#2224)
    • Add workflow to test featuretools without test dependencies (#2274)

Thanks to the following people for contributing to this release:
@cp2boston, @gsheni, @ozzieD, @stefaniesmith, @thehomebrewnerd

featuretools - v1.13.0

Published by ozzieD about 2 years ago

v1.13.0 Aug 18, 2022

  • Fixes
    • Allow boolean columns to be included in remove_highly_correlated_features (#2231)
  • Changes
    • Refactor schema version checking to use packaging method (#2230)
    • Extract duplicated logic for Rolling primitives into a general utility function (#2218)
    • Set pandas version to >=1.4.0 (#2246)
    • Remove workaround in roll_series_with_gap caused by pandas version < 1.4.0 (#2246)
  • Documentation Changes
    • Add line breaks between sections of IsFederalHoliday primitive docstring (#2235)
  • Testing Changes
    • Update create feedstock PR forked repo to use (#2223, #2237)
    • Update development requirements and use latest for documentation (#2225)

Thanks to the following people for contributing to this release:
@gsheni, @ozzieD, @sbadithe, @tamargrey

Package Rankings
Top 19.62% on Anaconda.org
Top 0.99% on Pypi.org
Top 5.19% on Proxy.golang.org
Top 8.11% on Conda-forge.org
Related Projects