Add hot fix for copy_to_buffer #785

WenzDaniel · 2023-12-13T09:21:42Z

What is the problem / what does the code in this PR do
In this PR we add a hotfix for the issue described in #781. Currently, we raise an error in copy_to_buffer if the dtypes of input and buffer changes, but the same function name is used. This also leads to failing tests in XENONnT/straxen#1299 which is needed for XENONnT/straxen#1300.

Can you briefly describe how it works?
The PR is only a hotfix as it only adds a random string to the copy function name if the typing error is raised. However, this also means that the copy function is not cached after a dtype change unless the cache is cleared.

For the future:

A better approach would be to evaluate the cached function first and define the copy to buffer function name based on the dtype difference.

…to fix_copy_to_buffer

JoranAngevaare · 2023-12-13T09:50:52Z

Can you not just change the name of the cache-key when you call the function? strax.copy_to_buffer(peaks, result, "bla") -> strax.copy_to_buffer(peaks, result, f"bla_{self.some_option").

Probably still good to have this check in place of course that the dtype matches 😉.

WenzDaniel · 2023-12-13T10:03:09Z

Hej @JoranAngevaare nice to hear from you. Yes I have the same concern, and I also thought about adding to every plugin where we are calling this function an extra option, but this is a bit too cumbersome in my opinion. Right now I have no idea for a good solution. I do not know how we can get the dtype in the chached function easily. I am open for suggestions.

WenzDaniel · 2023-12-13T10:04:41Z

Looks like that this hotfix does not work anyhow. For me locally it was working though.

JoranAngevaare · 2023-12-13T10:36:19Z

Hi Daniel, likewise! Hope all is well. I just stumbled on this PR as I'm still subscribed to strax.

Something simple
The caching system is not very sophisticated, so you can just dump whatever string. I doubt that people are ever looking into the actual cached names.

Something blunt, that would also erase the need of the func_name is just adding the repr of the dtype:

>>> a=np.ones([1,2], dtype=[('a', np.int64), ('b', np.int64)])
>>> repr(a.dtype)
"dtype([('a', '<i8'), ('b', '<i8')])"

You can just add that to the func_name, or even just make the func_name optional and just cache the repr of the dtype. You can make it more fancy buy throwing the repr in the strax.deterministic_hash.

Something a bit more advanced
Some other things worth considering would be making the cache explicit, e.g. following the URLConfig _CACHE. I don't remember why I added it to globals at the time rather than some global dictionary.

Another, additional option might be using an LRU cache to make sure that the number of cached functions doesn't blow up, maybe something like straxen.CacheDict, a minimal implementation is also used in CMT1.

Both are probably overkill, I don't really believe that you can end up in a case where somehow the dtype is changed every time someone calls this function. But we have to keep that poor PhD in mind that will one day face the memory leak that my limited imagination had discarded 😉.

The minimal amount of work is probably adding a _COPY_CACHE global dictionary instead of globals() and raising an error if more than a 1e6 keys are cached.

WenzDaniel · 2023-12-13T11:26:11Z

The caching system is not very sophisticated, so you can just dump whatever string. I doubt that people are ever looking into the actual cached names

Yes, I also decided now to make it very simple. I just cache in addition the dtype when calling the function. In this way the number of cached functions should be well less than 10.

WenzDaniel · 2023-12-13T11:41:17Z

I do not get why it is failing here... locally it works...

JoranAngevaare · 2023-12-13T12:50:03Z

It simply fails because coveralls is testing without numba, but why would you want to use a try-except block when you can just always add the dtype fields by default to the cached function. Also much better performance wise?

WenzDaniel · 2023-12-13T15:28:23Z

Haha thanks @JoranAngevaare for pointing it out. I also noticed now that it is a different raise condition. I thought just the same about making it simpler :D. You should come back to us you are missed :-) I only use the first two letters of each field in the dtype to let the name not explode.

Being ACs comes with very little time for these things.... this sucks a bit...

coveralls · 2023-12-13T15:34:22Z

coverage: 91.276% (-0.3%) from 91.581%
when pulling 61d0e75 on fix_copy_to_buffer
into b0ca3cb on master.

JoranAngevaare

Hi Daniel, I'm sure there is still loads of fun to be had in XENON 😉.

Changes look good! Seems simple and effective. Only one suggestion that would also cache the dtype of the field might be something as below, the field_names_hash will be anyway unreadable so you might consider including the np.int64 np.int16 etc. as well.

Additionally, you can now make the func_name arguement optional, I don't really think there is a use case where you'd use the same buffer fields but for some reason need a new func_name.

Good to have the test in place 👍 !

strax/dtypes.py

Co-authored-by: Joran R. Angevaare <[email protected]>

WenzDaniel · 2023-12-18T07:06:29Z

I merge since Joran basically approved.

WenzDaniel and others added 4 commits December 13, 2023 03:16

Add hot fix for copy_to_buffer

7a8681b

Merge branch 'master' into fix_copy_to_buffer

bc2d348

Fix

302a0fb

Merge branch 'fix_copy_to_buffer' of github.com:AxFoundation/strax in…

3d1fe49

…to fix_copy_to_buffer

WenzDaniel added 2 commits December 13, 2023 05:23

Make simple caching

f8968a6

Make code factor happy

7d51410

WenzDaniel added 3 commits December 13, 2023 05:27

Now happy?

19dcac8

...

94b6f0e

Me being stupid

c54b0e6

Make it simple...

54aa030

WenzDaniel mentioned this pull request Dec 13, 2023

Make peaklets dtype flexiable XENONnT/straxen#1299

Merged

FaroutYLq self-requested a review December 14, 2023 16:11

Merge branch 'master' into fix_copy_to_buffer

d97ed7d

JoranAngevaare previously approved these changes Dec 15, 2023

View reviewed changes

strax/dtypes.py Outdated Show resolved Hide resolved

WenzDaniel dismissed JoranAngevaare’s stale review via 18db0ff December 15, 2023 10:45

WenzDaniel and others added 2 commits December 15, 2023 11:45

Update strax/dtypes.py

18db0ff

Co-authored-by: Joran R. Angevaare <[email protected]>

Fix import update comment

61d0e75

WenzDaniel merged commit 70f69c8 into master Dec 18, 2023
11 checks passed

WenzDaniel deleted the fix_copy_to_buffer branch December 18, 2023 07:06

dachengx mentioned this pull request Aug 25, 2024

Copy to buffer breaks if dtype changes. #781

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add hot fix for copy_to_buffer #785

Add hot fix for copy_to_buffer #785

WenzDaniel commented Dec 13, 2023

JoranAngevaare commented Dec 13, 2023 •

edited

Loading

WenzDaniel commented Dec 13, 2023

WenzDaniel commented Dec 13, 2023

JoranAngevaare commented Dec 13, 2023 •

edited

Loading

WenzDaniel commented Dec 13, 2023

WenzDaniel commented Dec 13, 2023

JoranAngevaare commented Dec 13, 2023 •

edited

Loading

WenzDaniel commented Dec 13, 2023 •

edited

Loading

coveralls commented Dec 13, 2023 •

edited

Loading

JoranAngevaare left a comment

WenzDaniel commented Dec 18, 2023

Add hot fix for copy_to_buffer #785

Add hot fix for copy_to_buffer #785

Conversation

WenzDaniel commented Dec 13, 2023

JoranAngevaare commented Dec 13, 2023 • edited Loading

WenzDaniel commented Dec 13, 2023

WenzDaniel commented Dec 13, 2023

JoranAngevaare commented Dec 13, 2023 • edited Loading

WenzDaniel commented Dec 13, 2023

WenzDaniel commented Dec 13, 2023

JoranAngevaare commented Dec 13, 2023 • edited Loading

WenzDaniel commented Dec 13, 2023 • edited Loading

coveralls commented Dec 13, 2023 • edited Loading

JoranAngevaare left a comment

Choose a reason for hiding this comment

WenzDaniel commented Dec 18, 2023

JoranAngevaare commented Dec 13, 2023 •

edited

Loading

JoranAngevaare commented Dec 13, 2023 •

edited

Loading

JoranAngevaare commented Dec 13, 2023 •

edited

Loading

WenzDaniel commented Dec 13, 2023 •

edited

Loading

coveralls commented Dec 13, 2023 •

edited

Loading