Skip to content

Releases: alteryx/featuretools

v0.12.0

31 Oct 22:32
ace9181
Compare
Choose a tag to compare

v0.12.0 Oct 31, 2019

  • Enhancements
    • Added First primitive (#770)
    • Added Entropy aggregation primitive (#779)
    • Allow custom naming for multi-output primitives (#780)
  • Fixes
    • Prevents user from removing base entity time index using additional_variables (#768)
    • Fixes error when a multioutput primitive was supplied to dfs as a groupby trans primitive (#786)
  • Changes
    • Drop Python 2 support (#759)
    • Add unit parameter to AvgTimeBetween (#771)
    • Require Pandas 0.24.1 or higher (#787)
  • Documentation Changes
    • Update featuretools slack link (#765)
    • Set up repo to use Read the Docs (#776)
    • Add First primitive to API reference docs (#782)
  • Testing Changes
    • CircleCI fixes (#774)
    • Disable PIP progress bars (#775)

Thanks to the following people for contributing to this release:
@ablacke-ayx, @BoopBoopBeepBoop, @jeffzi, @kmax12, @rwedge, @thehomebrewnerd, @twdobson

v0.4.1

17 Oct 20:57
41689ce
Compare
Choose a tag to compare

v0.4.1 Nov 29, 2018

  • Resolve bug preventing using first column as index by default (#308)
  • Handle return type when creating features from Id variables (#318)
  • Make id an optional parameter of EntitySet constructor (#324)
  • Handle primitives with same function being applied to same column (#321)
  • Update requirements (#328)
  • Clean up DFS arguments (#319)
  • Clean up Pandas Backend (#302)
  • Update properties of cumulative transform primitives (#320)
  • Feature stability between versions documentation (#316)
  • Add download count to GitHub readme (#310)
  • Fixed #297 update tests to check error strings (#303)
  • Remove usage of fixtures in agg primitive tests (#325)

v0.11.0

30 Sep 22:13
b4586c3
Compare
Choose a tag to compare

v0.11.0 Sep 30, 2019

  • Enhancements
    • Improve how files are copied and written (#721)
    • Add number of rows to graph in entityset.plot (#727)
    • Added support for pandas DateOffsets in DFS and Timedelta (#732)
    • Enable feature-specific top_n value using a dictionary in encode_features (#735)
    • Added progress_callback parameter to dfs() and calculate_feature_matrix() (#739 #745)
    • Enable specifying primitives on a per column or per entity basis (#748)
  • Fixes
    • Fixed entity set deserialization (#720)
    • Added error message when DateTimeIndex is a variable but not set as the time_index (#723)
    • Fixed CumCount and other group-by transform primitives that take ID as input (#733, #754)
    • Fix progress bar undercounting (#743)
  • Updated training_window error assertion to only check against observations (#728)
    • Don't delete the whole destination folder while saving entityset (#717)
  • Changes
    • Raise warning and not error on schema version mismatch (#718)
    • Change feature calculation to return in order of instance ids provided (#676)
    • Removed time remaining from displayed progress bar in dfs() and calculate_feature_matrix() (#739)
    • Raise warning in normalize_entity() when time_index of base_entity has an invalid type (#749)
    • Remove toolz as a direct dependency (#755)
    • Allow boolean variable types to be used in the Multiply primitive (#756)
  • Documentation Changes
    • Updated URL for Compose (#716)
  • Testing Changes

Thanks to the following people for contributing to this release: @angela97lin, @chidauri, @christopherbunn, @frances-h, @jeff-hernandez, @kmax12, @MarcoGorelli, @rwedge, @thehomebrewnerd

v0.10.1

25 Aug 23:50
1fd1df0
Compare
Choose a tag to compare

v0.10.1 Aug 25, 2019

  • Fixes
    • Fix serialized LatLong data being loaded as strings (#712)
  • Documentation Changes
    • Fixed FAQ cell output (#710)

Thanks to the following people for contributing to this release:
@gsheni, @rwedge

v0.10.0

19 Aug 23:11
4bcb4ad
Compare
Choose a tag to compare

v0.10.0 Aug 19, 2019

The next non-bugfix release of Featuretools will not support Python 2

  • Enhancements
    • Give more frequent progress bar updates and update chunk size behavior (#631, #696)
    • Added drop_first as param in encode_features (#647)
    • Added support for stacking multi-output primitives (#679)
    • Generate transform features of direct features (#623)
    • Added serializing and deserializing from S3 and deserializing from URLs (#685)
    • Added nlp_primitives as an add-on library (#704)
    • Added AutoNormalize to Featuretools plugins (#699)
    • Added functionality for relative units (month/year) in Timedelta (#692)
    • Added categorical-encoding as an add-on library (#700)
  • Fixes
    • Fix performance regression in DFS (#637)
    • Fix deserialization of feature relationship path (#665)
    • Set index after adding ancestor relationship variables (#668)
    • Fix user-supplied variable_types modification in Entity init (#675)
    • Don't calculate dependencies of unnecessary features (#667)
    • Prevent normalize entity's new entity having same index as base entity (#681)
    • Update variable type inference to better check for string values (#683)
  • Changes
    • Moved dask, distributed imports (#634)
  • Documentation Changes
    • Miscellaneous changes (#641, #658)
    • Modified doc_string of top_n in encoding (#648)
    • Hyperlinked ComposeML (#653)
    • Added FAQ (#620, #677)
    • Fixed FAQ question with multiple question marks (#673)
  • Testing Changes
    • Add master, and release tests for premium primitives (#660, #669)
    • Miscellaneous changes (#672, #674)

Thanks to the following people for contributing to this release:
@alexjwang, @allisonportis, @ayushpatidar, @CJStadler, @ctduffy, @gsheni, @jeff-hernandez, @jeremyliweishih, @kmax12, @rwedge, @zhxt95

v0.9.1

03 Jul 21:18
f7adfde
Compare
Choose a tag to compare

v0.9.1 July 3, 2019

  • Enhancements
    • Speedup groupby transform calculations (#609)
    • Generate features along all paths when there are multiple paths between entities (#600, #608)
  • Fixes
    • Select columns of dataframe using a list (#615)
    • Change type of features calculated on Index features to Categorical (#602)
    • Filter dataframes through forward relationships (#625)
    • Specify Dask version in requirements for python 2 (#627)
    • Keep dataframe sorted by time during feature calculation (#626)
    • Fix bug in encode_features that created duplicate columns of
      features with multiple outputs (#622)
  • Changes
    • Remove unused variance_selection.py file (#613)
    • Remove Timedelta data param (#619)
    • Remove DaysSince primitive (#628)
  • Documentation Changes
    • Add installation instructions for add-on libraries (#617)
    • Miscellaneous changes (#632, #639)
  • Testing Changes

Thanks to the following people for contributing to this release: @CJStadler, @gsheni, @kkleidal, @kmax12, @rwedge

v0.9.0

19 Jun 19:14
d18dfaa
Compare
Choose a tag to compare

v0.9.0 June 19, 2019

  • Enhancements
    • Add unit parameter to timesince primitives (#558)
    • Add ability to install optional add on libraries (#551)
    • Load and save features from open files and strings (#566)
    • Support custom variable types (#571)
    • Support entitysets which have multiple paths between two entities (#572, #544)
    • Added show_info function, more output information added to CLI featuretools info (#525)
  • Fixes
    • Normalize_entity specifies error when 'make_time_index' is an invalid string (#550)
    • Schema version added for entityset serialization (#586)
    • Renamed features have names correctly serialized (#585)
    • Improved error message for index/time_index being the same column in normalize_entity and entity_from_dataframe (#583)
    • Removed all mentions of allow_where (#587, #588)
    • Removed unused variable in normalize entity (#589)
    • Change time since return type to numeric (#606)
  • Changes
    • Refactor get_pandas_data_slice to take single entity (#547)
    • Updates TimeSincePrevious and Diff Primitives (#561)
    • Remove unecessary time_last variable (#546)
  • Documentation Changes
    • Add Featuretools Enterprise to documentation (#563)
    • Miscellaneous changes (#552, #573, #577, #599)
  • Testing Changes

Thanks to the following people for contributing to this release:
@alexjwang, @allisonportis, @CJStadler, @ctduffy, @gsheni, @kmax12, @rwedge

v0.8.0

18 May 00:29
2546a68
Compare
Choose a tag to compare

v0.8.0 May 17, 2019

  • Rename NUnique to NumUnique (#510)
  • Serialize features as JSON (#532)
  • Drop all variables at once in normalize_entity (#533)
  • Remove unnecessary sorting from normalize_entity (#535)
  • Features cache their names (#536)
  • Only calculate features for instances before cutoff (#523)
  • Remove all relative imports (#530)
  • Added FullName Variable Type (#506)
  • Add error message when target entity does not exist (#520)
  • New demo links (#542)
  • Remove duplicate features check in DFS (#538)
  • featuretools_primitives entry point expects list of primitive classes (#529)
  • Update ALL_VARIABLE_TYPES list (#526)
  • More Informative N Jobs Prints and Warnings (#511)
  • Update sklearn version requirements (#541)
  • Update Makefile (#519)
  • Remove unused parameter in Entity._handle_time (#524)
  • Remove build_ext code from setup.py (#513)
  • Documentation updates (#512, #514, #515, #521, #522, #527, #545)
  • Testing updates (#509, #516, #517, #539)

Thanks to the following people for contributing to this release: @bphi, @CharlesBradshaw, @CJStadler, @glentennis, @gsheni, @kmax12, @rwedge

v0.7.1

24 Apr 15:51
6ed059c
Compare
Choose a tag to compare

v0.7.1 Apr 24, 2019

  • Automatically generate feature name for controllable primitives (#481)
  • Primitive docstring updates (#489, #492, #494, #495)
  • Change primitive functions that returned strings to return functions (#499)
  • CLI customizable via entrypoints (#493)
  • Improve calculation of aggregation features on grandchildren (#479)
  • Refactor entrypoints to use decorator (#483)
  • Include doctests in testing suite (#491)
  • Documentation updates (#490)
  • Update how standard primitives are imported internally (#482)

Thanks to the following people for contributing to this release: @bukosabino, @CharlesBradshaw, @glentennis, @gsheni, @jeff-hernandez, @kmax12, @minkvsky, @rwedge, @thehomebrewnerd

v0.7.0

29 Mar 17:19
c30d842
Compare
Choose a tag to compare

v0.7.0 Mar 29, 2019

  • Improve Entity Set Serialization (#361)
  • Support calling a primitive instance's function directly (#461, #468)
  • Support other libraries extending featuretools functionality via entrypoints (#452)
  • Remove featuretools install command (#475)
  • Add GroupByTransformFeature (#455, #472, #476)
  • Update Haversine Primitive (#435, #462)
  • Add commutative argument to SubtractNumeric and DivideNumeric primitives (#457)
  • Add FilePath variable_type (#470)
  • Add PhoneNumber, DateOfBirth, URL variable types (#447)
  • Generalize infer_variable_type, convert_variable_data and convert_all_variable_data methods (#423)
  • Documentation updates (#438, #446, #458, #469)
  • Testing updates (#440, #444, #445, #459)

Thanks to the following people for contributing to this release: @bukosabino, @CharlesBradshaw, @ColCarroll, @glentennis, @grayskripko, @gsheni, @jeff-hernandez, @jrkinley, @kmax12, @RogerTangos, @rwedge

Breaking Changes

  • ft.dfs now has a groupby_trans_primitives parameter that DFS uses to automatically construct features that group by an ID column and then apply a transform primitive to search group. This change applies to the following primitives: CumSum, CumCount, CumMean, CumMin, and CumMax.

    Previous behavior

    .. code-block:: python

      ft.dfs(entityset=es,
             target_entity='customers',
             trans_primitives=["cum_mean"])
    

    New behavior

    .. code-block:: python

      ft.dfs(entityset=es,
             target_entity='customers',
             groupby_trans_primitives=["cum_mean"])
    
  • Related to the above change, cumulative transform features are now defined using a new feature class, GroupByTransformFeature.

    Previous behavior

    .. code-block:: python

      ft.Feature([base_feature, groupby_feature], primitive=CumulativePrimitive)
    

    New behavior

    .. code-block:: python

      ft.Feature(base_feature, groupby=groupby_feature, primitive=CumulativePrimitive)
    

Summary: