Combining databases to predict is much less than separately #111

1835033964 · 2023-02-15T08:44:01Z

Thanks to the hgtector, it helps me a lot. However I encountered a question when using hgtector, and I'd like to ask you about it.

The following are my steps：
First，I have download the microbe database and finished steps hgtector search and hgtector analysis. 1140 HGT-derived genes were predicted. (outputfile is "analysis_dir/hgts/result.txt")
Then，I have download the plant database and finished steps hgtector search and hgtector analysis. 776 HGT-derived genes were predicted.
Finally, I concatenated the microbe database and plant database (I merged the database that are fasta format, and make database with diamond) and finished the same steps. However, only 6 HGT-derived genes were predicted.
When I was running the hgtector, all the parameters are default. When building the database, The taxdump file used to create the database is the file downloaded from the "hgtector database" command （taxdump.tar.gz 57.43Mb）.And the taxonmap file is "prot.accession2taxid" file provided by Nr.

Is it reasonable to combine databases to predict much less than to predict separately, and what might be the cause? Thank you.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Combining databases to predict is much less than separately #111

Combining databases to predict is much less than separately #111

1835033964 commented Feb 15, 2023

Combining databases to predict is much less than separately #111

Combining databases to predict is much less than separately #111

Comments

1835033964 commented Feb 15, 2023