Storage of (large) test data #277

athowes · 2024-09-03T12:21:57Z

Now for some data sets used in tests we are creating them with inst/generate_examples.R then storing them in inst/extdata. At the moment this is fit.rds and fit_gamma.rds. These files are above the 50.00 MB that GitHub recommends as maximum file size.

One option is to thin down these fits to make them smaller.

Another option is that this is the wrong approach and we should rethink where we store data for tests / how we are approaching this somehow.

The text was updated successfully, but these errors were encountered:

seabbs · 2024-09-03T13:09:31Z

One option is to thin down these fits to make them smaller.

In the first instance thin down and yes its definitely a "wrong" approach but I think it works for now. We should have an issue to explore alternatives. IMO I can't see a reason these would need to be this big

athowes · 2024-09-03T13:25:05Z

I'm happy with closing this issue on thinning then adding another issue for alternatives.

seabbs · 2024-09-03T18:02:39Z

I just looked and it seems like we could largely replace this approach by running a model fit in setup.R?

seabbs · 2024-09-04T09:41:26Z

Note that setup.R always runs before the test suite runs and so anything in it is always available.

athowes · 2024-09-04T09:54:35Z

The downside is that when I am doing things locally I run setup.R to generate the objects needed. So ideally there wouldn't be long running things in there (like model fitting)

seabbs · 2024-09-04T10:00:27Z

but surely any fit we need for a test is going to be <30 seconds? There really doesn't seem like a need for more?

athowes · 2024-09-04T10:03:01Z

The Gamma one is not <30 seconds currently.

And If we are intending to do "parameter recovery" integration tests then they can't be bad fits.

seabbs · 2024-09-04T10:08:05Z

That doesn't seem ideal and it feels like the model should be workable with 30 seconds a core so I find this surprising?

athowes · 2024-09-04T10:11:03Z

I might not have been setting cores as argument. I can post here what the actual runtimes are.

athowes added the low For a future release label Sep 3, 2024

athowes mentioned this issue Sep 4, 2024

Issue 248: Add prediction and log-likelihood methods for the latent_gamma family #273

Merged

12 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Storage of (large) test data #277

Storage of (large) test data #277

athowes commented Sep 3, 2024

seabbs commented Sep 3, 2024

athowes commented Sep 3, 2024

seabbs commented Sep 3, 2024

seabbs commented Sep 4, 2024

athowes commented Sep 4, 2024

seabbs commented Sep 4, 2024

athowes commented Sep 4, 2024

seabbs commented Sep 4, 2024

athowes commented Sep 4, 2024

Storage of (large) test data #277

Storage of (large) test data #277

Comments

athowes commented Sep 3, 2024

seabbs commented Sep 3, 2024

athowes commented Sep 3, 2024

seabbs commented Sep 3, 2024

seabbs commented Sep 4, 2024

athowes commented Sep 4, 2024

seabbs commented Sep 4, 2024

athowes commented Sep 4, 2024

seabbs commented Sep 4, 2024

athowes commented Sep 4, 2024