new cluster test API #12663

CarinaFo · 2024-06-14T12:23:46Z

set up new cluster_test api that sets up design matrix based on Wilkinson formula (using formulaic package)

currently unable to continue working on this project due to moving continent (soon based in Australia), estimated back to normal by end of November

for more information, see https://pre-commit.ci

…inaFo/mne-python into new_cluster_stats_api_GSOC24

for more information, see https://pre-commit.ci

…inaFo/mne-python into new_cluster_stats_api_GSOC24

for more information, see https://pre-commit.ci

drammock

just a few comments to get you rolling. Maybe ping us when the formulaic bit is added? (or sooner if you have any questions in the meantime, of course!)

tutorials/stats-sensor-space/76_new_cluster_test_api.py

hoechenberger · 2024-06-15T15:54:17Z

This might be related:

mne-tools/mne-incubator#31

cc @SophieHerbst

cbrnr · 2024-06-19T13:11:45Z

I think it is a great idea to revamp the current cluster permutation test API 🚀! Could you please share the reasoning behind choosing a pandas DataFrame as the container for evokeds and related meta info? If possible, I'd try to avoid using pandas (an optional dependency) when something similar could be achieved using, for example, a simple dictionary.

CarinaFo · 2024-06-19T16:17:41Z

@cbrnr I think the main reason for a dataframe instead of a dictionary is that formulaic only allows for dataframes as input and we want to include Wilkinson formula support in the new cluster_test API.

hoechenberger · 2024-06-19T16:26:59Z

@cbrnr I think the main reason for a dataframe instead of a dictionary is that formulaic only allows for dataframes as input and we want to include Wilkinson formula support in the new cluster_test API.

But this doesn't need to be user-facing, then, no? Just trying to understand. If a user passes in a list of TypedDicts or Dataclasses, you can internally create the DataFrames that need to be passed to formulaic. The user would then also get tab-completion assistance in their editor. But it's just a thought. Great work so far in any case!

drammock · 2024-06-19T16:31:39Z

you can internally create the DataFrames that need to be passed to formulaic

This would still require pandas to be available though.

hoechenberger · 2024-06-19T16:35:35Z

you can internally create the DataFrames that need to be passed to formulaic

This would still require pandas to be available though.

Sure
But it would provide a potentially more user-friendly API

for more information, see https://pre-commit.ci

drammock · 2024-06-19T19:08:01Z

you can internally create the DataFrames that need to be passed to formulaic

This would still require pandas to be available though.

Sure But it would provide a potentially more user-friendly API

For context, this is step 1 of a GSoC project. A later step involves (probably) creating helper functions that will create the necessary DataFrame for the user. That's not done here because:

there are probably complicated use cases / designs that we won't foresee, so we want it to be possible for the user to pass in their own custom dataframe instead of our internally-generated one.
we're actually not sure how helpful the helper function will be (at least in some cases): if what you need to pass in is a set of matched lists of subject IDs, evoked objects, and condition names, well then you might as well just call DataFrame(dict(subj=subj_list, cond=cond_list, data=evk_list)) yourself instead of calling the helper function. A helper function makes much more sense in other cases, like when dealing with Epochs objects where the conditions are intermixed within one object.

larsoner · 2024-06-25T16:46:49Z

@CarinaFo I pushed a commit to add the dataset and add formulaic to our full and doc dependencies so that eventually CIs can use them properly. I added the tags [skip azp] [skip actions] to skip running those CIs. Feel free to git pull the changes back to your local machine!

If you look at the CI runs, you can see that CircleCI, which builds modified examples/tutorials in PRs, hit an error up on 9c8ec90:

sphinx.errors.ExtensionError: Could not find docstring in file "/home/circleci/project/tutorials/stats-sensor-space/76_new_cluster_test_api.py". A docstring is required by sphinx-gallery unless the file is ignored by "ignore_pattern"

Then I pushed a little commit to fix that in 47363b5, and now it hits a different error (which replicates what I saw locally when I tried to run the example):

../tutorials/stats-sensor-space/76_new_cluster_test_api.py unexpectedly failed to execute correctly:

    Traceback (most recent call last):
      File "/home/circleci/project/tutorials/stats-sensor-space/76_new_cluster_test_api.py", line 449, in <module>
        df_long = convert_wide_to_long(df)
      File "/home/circleci/project/tutorials/stats-sensor-space/76_new_cluster_test_api.py", line 431, in convert_wide_to_long
        data_2d = row["data"]
      File "/home/circleci/python_env/lib/python3.10/site-packages/pandas/core/series.py", line 1121, in __getitem__
        return self._get_value(key)
      File "/home/circleci/python_env/lib/python3.10/site-packages/pandas/core/series.py", line 1237, in _get_value
        loc = self.index.get_loc(label)
      File "/home/circleci/python_env/lib/python3.10/site-packages/pandas/core/indexes/base.py", line 3812, in get_loc
        raise KeyError(key) from err
    KeyError: 'data'

It would be good to keep CircleCI green at least so we can see renderings of the tutorial with each push. If you make sure python -i tutorials/stats-sensor-space/76_new_cluster_test_api.py runs cleanly locally each time before you push then it should work on CircleCI now as well!

drammock · 2024-08-22T20:22:28Z

@CarinaFo FYI I needed to do a rebase and force-push in order to get the CIs to run again, as there was a merge conflict. You'll want to recreate your local branch. Ping me if you have questions / challenges (e.g. if you had local work you don't want to lose) and we can work through it together.

…inaFo/mne-python into new_cluster_stats_api_GSOC24

CarinaFo and others added 8 commits June 14, 2024 14:22

added cluster test api, first commit

62daaf0

[pre-commit.ci] auto fixes from pre-commit.com hooks

d59978f

for more information, see https://pre-commit.ci

tested dataframe function and results, cleaned up

2843905

Merge branch 'new_cluster_stats_api_GSOC24' of https://github.com/Car…

1985da3

…inaFo/mne-python into new_cluster_stats_api_GSOC24

added ToDos

fa5b215

[pre-commit.ci] auto fixes from pre-commit.com hooks

1a1511d

for more information, see https://pre-commit.ci

Merge branch 'new_cluster_stats_api_GSOC24' of https://github.com/Car…

0ea220c

…inaFo/mne-python into new_cluster_stats_api_GSOC24

[pre-commit.ci] auto fixes from pre-commit.com hooks

a12cf95

for more information, see https://pre-commit.ci

drammock reviewed Jun 14, 2024

View reviewed changes

Merge branch 'mne-tools:main' into new_cluster_stats_api_GSOC24

3c5d4f1

CarinaFo and others added 2 commits June 19, 2024 19:28

added formula support and implemented suggestions

45ce63a

[pre-commit.ci] auto fixes from pre-commit.com hooks

2b7bae8

for more information, see https://pre-commit.ci

CarinaFo and others added 4 commits June 22, 2024 11:10

fixed linting errors

38834ba

ENH: Add dataset [skip azp] [skip actions]

c00859f

FIX: One more [skip azp] [skip actions]

9c8ec90

FIX: Title [skip azp] [skip actions]

47363b5

CarinaFo added 6 commits June 30, 2024 20:11

first draft of formulaic paired t-test

1f6221d

first draft without cluster plotting class implemented

37616e5

cleaned up plotting function

6aaef9a

implemented cluser results class

0f99c70

added contribution

4083691

Merge branch 'mne-tools:main' into new_cluster_stats_api_GSOC24

42d70f9

drammock and others added 18 commits August 22, 2024 15:20

remove unused test helper func

cac0559

vulture allowlist update

47ac838

included BaseTFR in validate_cluster_df

033c158

comments on cluster_test function

2c2f341

updated clusterResult class and plot function

e9b5fa2

updated function call for plotting

2fd17d3

changed color

150c530

docstring/docdict cleanups and fixes

3cc9e2c

implemented Dan's comments

2c27a69

implemented Dan's comments

2664ee2

test for handling different MNE objects - test is failing

4927544

adjusted test to account for multiple subjects

006acdf

refactor df validation to return bools

f0f4cba

unrelated typing fix

346e3ce

rework test

a49d2cd

minor cleanup

a01182b

fix imports

0984b61

use MRO in test too

6322499

drammock force-pushed the new_cluster_stats_api_GSOC24 branch from 81ce0d0 to 6322499 Compare August 22, 2024 20:21

drammock and others added 5 commits August 22, 2024 15:23

fix vulture allowlist

a04b8a3

fix nesting and type hints

f1d39bf

strict=False

987ea43

nest import in test file too

78829b4

Merge branch 'new_cluster_stats_api_GSOC24' of https://github.com/Car…

eb98849

…inaFo/mne-python into new_cluster_stats_api_GSOC24

CarinaFo changed the title ~~added cluster test api, first commit~~ added cluster test api Sep 17, 2024

CarinaFo changed the title ~~added cluster test api~~ new cluster test API Sep 17, 2024

CarinaFo added 3 commits September 17, 2024 11:10

Merge branch 'main' into new_cluster_stats_api_GSOC24

ac943e3

clean up pyproject mess

372bcca

add n_permutations, plotting, added min_cluster_p_value

4da8463

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

new cluster test API #12663

new cluster test API #12663

CarinaFo commented Jun 14, 2024 •

edited

Loading

drammock left a comment

hoechenberger commented Jun 15, 2024

cbrnr commented Jun 19, 2024

CarinaFo commented Jun 19, 2024

hoechenberger commented Jun 19, 2024 •

edited

Loading

drammock commented Jun 19, 2024

hoechenberger commented Jun 19, 2024

drammock commented Jun 19, 2024 •

edited

Loading

larsoner commented Jun 25, 2024

drammock commented Aug 22, 2024

new cluster test API #12663

Are you sure you want to change the base?

new cluster test API #12663

Conversation

CarinaFo commented Jun 14, 2024 • edited Loading

drammock left a comment

Choose a reason for hiding this comment

hoechenberger commented Jun 15, 2024

cbrnr commented Jun 19, 2024

CarinaFo commented Jun 19, 2024

hoechenberger commented Jun 19, 2024 • edited Loading

drammock commented Jun 19, 2024

hoechenberger commented Jun 19, 2024

drammock commented Jun 19, 2024 • edited Loading

larsoner commented Jun 25, 2024

drammock commented Aug 22, 2024

CarinaFo commented Jun 14, 2024 •

edited

Loading

hoechenberger commented Jun 19, 2024 •

edited

Loading

drammock commented Jun 19, 2024 •

edited

Loading