-
Notifications
You must be signed in to change notification settings - Fork 37
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Questions about the principle of computation #78
Comments
Greetings, @YangJingqii ! Thanks for the questions.
Thanks! |
@YangJingqii thank you, it is possible to calculate πN/πS for individual codons, but there can be substantial uncertainty, and in the extreme many codons will have πS=0 or πS undefined. In the instance you show, πS is undefined, and πN/πS is undefined as well. Depending on whether you're just offering descriptive statistics or trying to test a specific hypothesis, other approaches with πN/πS (i.e. identifying a priori meaningful groups of codons which can be bootstrapped) or even dN/dS may be necessary. |
Hello @YangJingqii ! Regarding the possibility of ignoring certain variant frequencies for calculating π, this is no longer π. In other words, π is a property of a site, not a property of particular variants. (The one exception is excluding certain variants or frequencies which you suspect to be sequencing errors, e.g., AF<1%.) What you are suggesting is to calculate the mean number of pairwise differences among only a subset of variants, which is not customary and would probably be difficult to interpret. It's always possible to invent a new metric, but I'm not sure of your goal or what the meaning of the resultant metric would be. Regarding the paper you cite, I am not a coauthor so I do not know what the authors mean by only calculating variation among variants lower than 50%. Looking at the paper, I don't see a statement like that in the main text. Could you point out which part you are referring to? Chase |
Thank you for your response very much!This part of the article is mainly about these two sentences:
|
Interesting! Not sure about that. If you want to know what they did you may have to ask the authors. But the correct thing is to include all iSNVs which pass QC, regardless of frequency, in the π calculation. For example, there is technically no difference between a C>T iSNV that has AF=5% and a T>C iSNV that has AF=95%; the only difference is which allele is considered REF, and which is considered ALT. |
Hi,
I have some questions when using these scripts.
./SNPGenie_sliding_windows.R G_codon.txt N S 10 1 1000 100 NONE 6 > sw_G.out
, I get an error3.Designed for use with sequence data that outsizes what can be handled by other software platforms, I was wondering when does it not apply? Does calculating dn/ds without using phylogenetic tree create inaccuracies?
I realize I have raised quite a number of questions, and I truly appreciate your patience and expertise in helping me understand these important aspects of the analysis.
Thank you very much!
The text was updated successfully, but these errors were encountered: