More robust `delete_downstream_merge` #806

CBroz1 · 2024-01-26T23:02:38Z

Description

Now leveraging the underlying datajoint dependency on networkx to find the shortest pipeline paths
between a given table and various merges, or Session and a given table, for delete_downstream_merge and check_permissions in SpyglassMixin. By running a join on all tables across two points, each of these funcs should be more robust to edge cases.

Initial drafts proved to be very slow, building these chains across arbitrary pipeline points by hand. Even with networkx method, I opted to cache connections to improve speed when folks assign tables (e.g., from spy import Table; t=Table(); t.delete_downstream)

Fixes Cautious delete restriction #791 : Join on all tables between target and merges.
- common/common_usage.py : proposed new usage tracking function to monitor how long the cautious delete process takes, and where is it is used from
- dj_merge_tables: Migrate delete_downstream_merge out into the mixin to cache calculated links better
- dj_mixin.py: add cache of merge tables found and pipeline connections I dubbed 'chains' from self to merge tables. In practice, the number of merge tables I'm able to find depends on whether or not the user has imported them. The reload_cache flag allows a user to see a blocking merge part, import the relevant table, and then reload the cache
- Future versions could intercept the datajoint delete error and load these tables.
- Edited docs and notebooks to reflect new functionality
Ran jupytext on notebook edits from previous PRs
New black version 24.0 means a lot of minor changes elsewhere.

Checklist:

This PR should be accompanied by a release: Maybe not this one, but soon?
(If release) I have updated the CITATION.cff
I have updated the CHANGELOG.md
I have added/edited docs/notebooks to reflect the changes

review-notebook-app · 2024-01-29T22:38:33Z

Check out this pull request on

See visual diffs & provide feedback on Jupyter Notebooks.

Powered by ReviewNB

edeno

There are still some linter issues but otherwise looks good.

src/spyglass/utils/dj_mixin.py

edeno · 2024-01-30T16:42:24Z

Join on all tables between target and merges

Is this potentially expensive with a large database?

CBroz1 · 2024-01-30T17:30:09Z

Join on all tables between target and merges

Is this potentially expensive with a large database?

It is, yes. Using networkx is a huge speed improvement over doing the search myself to find the chain, but the join process could be expensive. With narrow restrictions on the parent, I found it pretty quick to run the join, especially with the cached chain. My new usage table is designed to monitor how this gets used, and how long it takes. If still cumbersome, I can look into replacing TableChain.join's python join process with something that would be more SQL-native

samuelbray32

Tested in the lab database for conditions that were problems before and they looked good. Thanks @CBroz1 !

Cases:

downstream table doesn't contain original restrictions
projection of key name between tables

CBroz1 added 3 commits January 25, 2024 16:38

WIP: fix for LorenFrankLab#791

50443f1

WIP: LorenFrankLab#791, pt 2

4bf8d79

WIP: LorenFrankLab#791, needs testing

8875699

edeno linked an issue Jan 27, 2024 that may be closed by this pull request

Cautious delete restriction #791

Closed

CBroz1 added 3 commits January 29, 2024 13:08

Faster tree search with networkx

7c0dd4d

Blackify

8385eb3

Blackify 2

24fa6ba

edeno added the infrastructure Unix, MySQL, etc. settings/issues impacting users label Jan 29, 2024

CBroz1 and others added 2 commits January 29, 2024 14:11

Update changelog/docs

170fb1e

Update notebooks

e706c7d

CBroz1 marked this pull request as ready for review January 29, 2024 22:41

CBroz1 mentioned this pull request Jan 30, 2024

Blackify 24.1.1 #808

Merged

Merge branch 'master' of https://github.com/LorenFrankLab/spyglass

013d085

edeno requested changes Jan 30, 2024

View reviewed changes

src/spyglass/utils/dj_mixin.py Outdated Show resolved Hide resolved

src/spyglass/utils/dj_mixin.py Outdated Show resolved Hide resolved

edeno requested a review from samuelbray32 January 30, 2024 16:37

samuelbray32 reviewed Jan 30, 2024

View reviewed changes

Overwrite . Mixin add cached_property decorator

895e656

CBroz1 requested a review from edeno January 30, 2024 22:02

Cleanup docstrings and type annotations

ca9e9d2

edeno approved these changes Jan 31, 2024

View reviewed changes

edeno merged commit b42432f into LorenFrankLab:master Jan 31, 2024
7 checks passed

CBroz1 mentioned this pull request Jan 31, 2024

Address join-compatibility issue for long chains #811

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

More robust `delete_downstream_merge` #806

More robust `delete_downstream_merge` #806

CBroz1 commented Jan 26, 2024 •

edited

Loading

review-notebook-app bot commented Jan 29, 2024

edeno left a comment

edeno commented Jan 30, 2024 •

edited by CBroz1

Loading

CBroz1 commented Jan 30, 2024

samuelbray32 left a comment

More robust delete_downstream_merge #806

More robust delete_downstream_merge #806

Conversation

CBroz1 commented Jan 26, 2024 • edited Loading

Description

Checklist:

review-notebook-app bot commented Jan 29, 2024

edeno left a comment

Choose a reason for hiding this comment

edeno commented Jan 30, 2024 • edited by CBroz1 Loading

CBroz1 commented Jan 30, 2024

samuelbray32 left a comment

Choose a reason for hiding this comment

More robust `delete_downstream_merge` #806

More robust `delete_downstream_merge` #806

CBroz1 commented Jan 26, 2024 •

edited

Loading

edeno commented Jan 30, 2024 •

edited by CBroz1

Loading