Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Add an --include all option to datasets download genome #375

Open
dtdoering opened this issue Jun 10, 2024 · 2 comments
Open

Add an --include all option to datasets download genome #375

dtdoering opened this issue Jun 10, 2024 · 2 comments
Labels
enhancement New feature or request

Comments

@dtdoering
Copy link

Is your feature request related to a problem? Please describe.

In my workflow, I frequently want to get the latest genome/annotation files for a number of RefSeq (GCF_*) and GenBank (GCA_*) genomes to do some further analyses. However, it can be hard to remember the exact spelling/terms used for each of the options, particularly when each desired file has to be listed.

Describe the solution you'd like

As a QoL feature, I'd like to be able to save some keystrokes by typing e.g.:

datasets download genome accession GCA_005981935.1 --include all

instead of:

datasets download genome accession GCA_005981935.1 --include genome,protein,cds,gff3,gbff,seq-report

So that the only thing I need to remember or copy/paste is the accession, instead of the accession and then the files listing.

Thanks!

@dtdoering dtdoering added the enhancement New feature or request label Jun 10, 2024
@ericcox1
Copy link
Collaborator

Hi @dtdoering,

Thanks for opening this issue. We will consider adding this feature in a future release.

Best,
Eric

Eric Cox, PhD [Contractor] (he/him/his)
NCBI Datasets
NIH/NLM/NCBI
[email protected]

@dtdoering
Copy link
Author

Adding another reason -- since many GenBank bacterial genomes only have annotations in GenBank format (and no GFF), the --include all option would be very useful when used with the --preview option, so that one can see which files are even available for a given genome before deciding whether to download it or choose a different one.

That said, thanks for the info! Would love to see this added in a future release (or take a stab at a PR for it myself, pending #229)!

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request
Projects
None yet
Development

No branches or pull requests

2 participants