Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

For unbinned contigs, run CAT in "contig mode" (CAT_pack contigs) instead of "bin mode" (CAT_pack bins) #735

Open
amizeranschi opened this issue Dec 17, 2024 · 1 comment
Labels
enhancement New feature or request

Comments

@amizeranschi
Copy link
Contributor

Hi !
I've been testing CAT for taxonomic classification of bins, which were created with metabat2 + maxbin2 and refined with das tool. I noticed that the pipeline runs CAT both on the bins and on the unbinned contigs. However, the latter are treated as one huge bin, and thus a single taxonomic label gets assigned to the whole lot:

For example:

MEGAHIT-DASToolUnbinned-PC2.fa	taxid assigned	based on 261612/286893 ORFs	1;131567;2	1.00;1.00;0.98	root (no rank): 1.00	cellular organisms (no rank): 1.00	Bacteria (superkingdom): 0.98

and:

MEGAHIT-DASToolUnbinned-PM2.fa	taxid assigned	based on 453818/502870 ORFs	1;131567;2;1224	1.00;1.00;0.97;0.35	root (no rank): 1.00	cellular organisms (no rank): 1.00	Bacteria (superkingdom): 0.97	Proteobacteria (phylum): 0.35

In both cases, there are plenty of ORFs that could be viral or eukaryotic and it would be interesting to have the ability to zoom in and see what is actually in there at a contig level.

For the unbinned contigs, it would make more sense to run CAT in "contig mode" (CAT_pack contigs) instead of "bin mode" (CAT_pack bins).
See: https://github.com/MGXlab/CAT_pack?tab=readme-ov-file#running-catbatrat

@amizeranschi amizeranschi added the enhancement New feature or request label Dec 17, 2024
@amizeranschi
Copy link
Contributor Author

Relevant thread on #Slack: https://nfcore.slack.com/archives/CE9MS66BS/p1734430094593639

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request
Projects
None yet
Development

No branches or pull requests

1 participant