Un-pandas `GroupedTimeSeriesSplit` #605

FBruzzesi · 2023-12-29T11:55:32Z

Description

In #604 there was a tiny hint at this 😂

In this PR I tried to remove the use of pandas from GroupedTimeSeriesSplit by moving on numpy backend. The only method that still needs a "dataframe"-like object is .summary() (which it's even a nice to have).

In a way this is an alternative way to achieve #597 when a dataframe is just a nice abstraction but not mandatory. The only function that has to work to proceed forward and run .split() method is indexable (from sklearn.utils.validation).

FBruzzesi · 2023-12-29T11:56:23Z

sklego/model_selection.py

        for i in range(self.n_splits):
-            yield np.where(groups == i)[0], np.where(groups == i + 1)[0]
+            yield group_indices[i], group_indices[i + 1]

    def _calc_first_and_last_split_index(self, X=None, y=None, groups=None):


In a way, the changes in _calc_first_and_last_split_index are the only hard core difference

FBruzzesi · 2024-06-18T11:10:15Z

Following all the effort put into adopting Narwhals, it would be nice to revamp this as well. I will close the PR and come back to the topic!

unpandas grouped ts split

83cfae7

FBruzzesi commented Dec 29, 2023

View reviewed changes

Merge branch 'main' into feature/unpandas-model-selection

0dff45c

FBruzzesi mentioned this pull request May 7, 2024

[FEATURE] Narwhals migration for dataframe-agnostic codebase #658

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Un-pandas `GroupedTimeSeriesSplit` #605

Un-pandas `GroupedTimeSeriesSplit` #605

FBruzzesi commented Dec 29, 2023

FBruzzesi Dec 29, 2023

FBruzzesi commented Jun 18, 2024

Un-pandas GroupedTimeSeriesSplit #605

Are you sure you want to change the base?

Un-pandas GroupedTimeSeriesSplit #605

Conversation

FBruzzesi commented Dec 29, 2023

Description

FBruzzesi Dec 29, 2023

Choose a reason for hiding this comment

FBruzzesi commented Jun 18, 2024

Un-pandas `GroupedTimeSeriesSplit` #605

Un-pandas `GroupedTimeSeriesSplit` #605