Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

download_dataset doesn't work as expected on my CHPC system. #38

Open
leannmlindsey opened this issue Mar 25, 2024 · 1 comment
Open

Comments

@leannmlindsey
Copy link

I was able to get download_dataset to work as expected on my macbook pro but when I try to use it on my university's CHPC system I get this error.

Do you have any idea what could be causing this? I am trying to repeat the experiments on the HyenaDNA paper and their code depends upon this function working properly.

(p100_hyena-dna) [u1323098@kp359:test_dir]$ python
Python 3.8.18 | packaged by conda-forge | (default, Oct 10 2023, 15:44:36)
[GCC 12.3.0] on linux
Type "help", "copyright", "credits" or "license" for more information.

from genomic_benchmarks.data_check import list_datasets
list_datasets()
['drosophila_enhancers_stark', 'dummy_mouse_enhancers_ensembl', 'human_ensembl_regulatory', 'demo_coding_vs_intergenomic_seqs', 'demo_human_or_worm', 'human_nontata_promoters', 'human_enhancers_ensembl', 'human_enhancers_cohn', 'human_ocr_ensembl']
from genomic_benchmarks.loc2seq import download_dataset
download_dataset("human_nontata_promoters", version=0)
Traceback (most recent call last):
File "", line 1, in
File "/uufs/chpc.utah.edu/common/home/u1323098/software/pkg/miniconda3/envs/p100_hyena-dna/lib/python3.8/site-packages/genomic_benchmarks/loc2seq/loc2seq.py", line 55, in download_dataset
return download_from_cloud_cache((dataset_name, version), Path(dest_path) / dataset_name)
File "/uufs/chpc.utah.edu/common/home/u1323098/software/pkg/miniconda3/envs/p100_hyena-dna/lib/python3.8/site-packages/genomic_benchmarks/loc2seq/cloud_caching.py", line 32, in download_from_cloud_cache
gdown.download(
File "/uufs/chpc.utah.edu/common/home/u1323098/software/pkg/miniconda3/envs/p100_hyena-dna/lib/python3.8/site-packages/gdown/download.py", line 259, in download
filename_from_url = m.groups()[0]
AttributeError: 'NoneType' object has no attribute 'groups'

@davidboening
Copy link

I had the same problem today, updating gdown fixed the issue:
pip install --upgrade gdown
I'm guessing this is due to some changes in the Google Drive API.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants