Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

database --default parameters #106

Open
Rundon-svg opened this issue Oct 7, 2022 · 0 comments
Open

database --default parameters #106

Rundon-svg opened this issue Oct 7, 2022 · 0 comments

Comments

@Rundon-svg
Copy link

Hi Qiyun,

In database.doc,
--default
is equivalent to
--cats microbe --sample 1 --rank species_latin --above --reference --represent --typemater --compile diamond

While the "--manual" used to compare the two command:

  1. --default
...
Done.
  Total number of genomes in categories: 263640.
Filtering genomes...
Done.
Filtering genomes by taxonomy...
  Dropped 43 genomes without capitalized organism name.
  Dropped 8 genomes without valid taxId.
Done.
Sampling genomes based on taxonomy...
Up to 1 genome(s) will be sampled per species.
  Sampled 39046 genomes from 39046 species.
Included 16 reference genomes.
Included 17182 representative genomes.
Total number of sampled genomes: 39048.
  1. equivalent command
...
Done.
  Total number of genomes in categories: 263640.
Filtering genomes...
Done.
Filtering genomes by taxonomy...
  Dropped 43 genomes without capitalized organism name.
  Dropped 8 genomes without valid taxId.
Done.
Sampling genomes based on taxonomy...
Up to 1 genome(s) will be sampled per species **(Latinate names only)** .
  Sampled 16980 genomes from 16980 species.
Sampling will also be performed on: genus, family, order, class, phylum.
  Sampled 270 more genomes from 270 genera.
  Sampled 0 more genomes from 0 families.
  Sampled 0 more genomes from 0 orders.
  Sampled 0 more genomes from 0 classes.
  Sampled 0 more genomes from 0 phyla.
  Sampled a total of 17250 genomes at species and above.
Included 16 reference genomes.
Included 17182 representative genomes.
Included 19724 type material genomes.
Total number of sampled genomes:   24042.

May the equivalent command add sampled steps? Or something I did wrong in installation and command line?

best,

@Rundon-svg Rundon-svg reopened this Oct 7, 2022
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant