Make dataset loaders on-the-fly #58

cthoyt · 2022-01-21T12:54:30Z

I think it would be better to have the dataset download and processing happen client-side, then use pystow to store the results in a reliable place. This would also allow the TWOSIDES and DrugBank datasets, which require random negative sampling, to be used with multiple random seeds, e.g. to investigate the robustness of results. Further, it would allow for a more idiomatic dataset loader that's extensible to new datasets

Depends on:

The text was updated successfully, but these errors were encountered:

cthoyt self-assigned this Jan 21, 2022

cthoyt added the dataset label Feb 1, 2022

cthoyt mentioned this issue Feb 15, 2022

Provide base class for dataset loaders #59

Merged

3 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Make dataset loaders on-the-fly #58

Make dataset loaders on-the-fly #58

cthoyt commented Jan 21, 2022 •

edited

Loading

Make dataset loaders on-the-fly #58

Make dataset loaders on-the-fly #58

Comments

cthoyt commented Jan 21, 2022 • edited Loading

cthoyt commented Jan 21, 2022 •

edited

Loading