Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Clusters with different parent number but the same MAD, seed, length, n_taxa, n_seqs #55

Open
Bunholi opened this issue May 20, 2021 · 0 comments

Comments

@Bunholi
Copy link

Bunholi commented May 20, 2021

Hi @DomBennett

I followed all the pipeline and everything went well but when examining the data from the summary(phylotaR object), I noticed that there are repetitive clusters with the same MAD, seed, length seq, n_taxa, n_seq but with a different Parent number. However, I looked at the taxa ID and those clusters include the same sequences, which means they are equal, but one corresponds to the parent number from the gender and the other from the tribe.

image

I noticed that we can filter that (to keep only one of them) using the MAD variable. So, I would like to know if there is some function like "get_MAD" or something to isolate this measurement and be able to drop those "repetitive" clusters.

Thank you,

Ingrid Bunholi

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant