More large-catalog speedups #397

jvansanten · 2024-09-09T19:15:10Z

This PR introduces a bundle of changes, that, together with #396, reduce the time to initialize an injector and run a single trial with 10k sources and the 10-year NT dataset (gamma_precision reduced to 0.25, so only 8 gamma points) from 6.2 hours to 3 minutes. Of that, a solid 2 minutes are spent in load_data()*. The speedups arise as follows:

Add a new TableInjector that keeps its MC data in an astropy.table.Table, sorted by declination, rather than in a gigantic structured array. This helps in two ways. First, operations on individual columns of the table make better use of caches, since the values are stored contiguously in memory, rather than 176 bytes apart as in the array. Second, sorting the table by declination means that the expensive band-masking step can be replaced by a cheap binary search. This is also more cache-efficient than indexing with a boolean array, since applying a boolean mask requires reading every single value in the column (or worse, the structured array). This injector is in all ways better than LowMemoryInjector (and perhaps, the base MCInjector)
Optimize StdMatrixKDEEnabledLLH's coincident event selection in much the same way, using a Table to store the input data, sorted in declination, and processing sources in declination order.
Build up SoB_only_matrix row-by-row in canonical CSR format, and skip constructing an explicit coincidence matrix entirely. Updating a scipy.sparse matrix turns out to be surprisingly expensive, and largely unnecessary.

*Most of the remaining fat is in flarestack.icecube_utils.dataset_loader.load_dataset() and its use of append_records(). This could be obviated by using astropy Table everywhere, but would make the most sense if the data were stored that way in the first place. That's probably too large of a change at this point.

Slicing with a boolean array already returns a copy

Bandmasks are completely unncessary data are ordered in declination; binary search is cheap. Speeds up injector initialization by a factor ~60 with 1000 sources.

csr_matrix.getrow() is suprisingly slow (lots of layers of indirection and argument checking), but also completely unnecessary, since coincidence_matrix and SoB_only_matrix have the same sparsity structure by construction.

sathanas31 · 2024-09-24T22:19:44Z

Finally tested! For same (single) scale & seed, 1 trial for 1 src, the scrambled dataset using table_injector is the same with that from low_memory_injector. Using this table_injector dataset, the llh kwargs were compared and found same with what you get from main (merged #396). Unless you want me to check low-level params (eg coincident data), we good to go :D

JannisNe · 2024-09-25T06:35:07Z

If the datasets are the same, then also the coincident data will be.

jvansanten added 6 commits September 9, 2024 20:48

Remove unnecessary np.copy

26edfda

Slicing with a boolean array already returns a copy

Add TableInjector

1ba3f20

Bandmasks are completely unncessary data are ordered in declination; binary search is cheap. Speeds up injector initialization by a factor ~60 with 1000 sources.

Order data by dec in StdMatrixKDEEnabledLLH

2e48a3e

Build coincidence matrix directly

a3c7993

Skip coincidence matrix entirely

c2f3581

csr_matrix.getrow() is suprisingly slow (lots of layers of indirection and argument checking), but also completely unnecessary, since coincidence_matrix and SoB_only_matrix have the same sparsity structure by construction.

mypy: ignore all of astropy

345bea1

Merge branch 'master' into oneshot-sob

826ba80

jvansanten merged commit 50ca20e into icecube:master Sep 25, 2024
9 of 13 checks passed

jvansanten deleted the oneshot-sob branch September 25, 2024 07:37

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

More large-catalog speedups #397

More large-catalog speedups #397

jvansanten commented Sep 9, 2024 •

edited

Loading

sathanas31 commented Sep 24, 2024

JannisNe commented Sep 25, 2024

More large-catalog speedups #397

More large-catalog speedups #397

Conversation

jvansanten commented Sep 9, 2024 • edited Loading

sathanas31 commented Sep 24, 2024

JannisNe commented Sep 25, 2024

jvansanten commented Sep 9, 2024 •

edited

Loading