Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[Feature]: Add workflow for downloading reference genomes #47

Open
MillironX opened this issue Jun 20, 2022 · 0 comments
Open

[Feature]: Add workflow for downloading reference genomes #47

MillironX opened this issue Jun 20, 2022 · 0 comments
Labels
enhancement New feature or request

Comments

@MillironX
Copy link
Member

Summary

There should be a way to pre-download all of the present genomes from NCBI so that YAVSAP can be run offline.

Added Features

Additional parameters

Propose any new parameters that would be needed to implement this feature,
including its name, type, and default value, and explain how it would be
consumed by the new/existing processes.

  • --download_genomes
    • Switches the pipeline to "genome download" mode, where it will not run any analysis, but will instead simply download and reformat the reference genome files from NCBI.

Additional output

  1. Reference genome fasta/GenBank file corresponding to --genome
  2. Strain reference genome fasta/GenBank file containing all of the sequences from --genome_list
  3. Strain reference genome TSV file that points to 2 in a format YAVSAP can recognize (i.e., from [Feature]: Add ability to use files as a reference genome #44, /data/yavsap/results/genomes.fasta:NC_01234.1).

These files should probably be output even if downloading the files directly in the pipeline (the way it's done now).

@MillironX MillironX added the enhancement New feature or request label Jun 20, 2022
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request
Projects
None yet
Development

No branches or pull requests

1 participant