-
Notifications
You must be signed in to change notification settings - Fork 47
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Also encountered "The estimated prior variance is unreasonably large" using ieugwasr ld_matrix_local function to calculate the ld matrix #223
Comments
FYI my session info: Matrix products: default locale: time zone: Asia/Shanghai attached base packages: other attached packages: loaded via a namespace (and not attached): |
@leoarrow1 If you haven't already done so, I would recommend checking for inconsistencies between the LD matrix and the matrix z-scores following this vignette. |
Thanks for your prompt response. Yes I have check this vignette as well as the former similar post on this issue. The solution was to add the "Adding the flag --keep-allele-order" in plink. But I have checked the ieugwasr code, it does have this flag in it. So this problem might be cause due to something else? Could you perhaps spot anything unusual in my code?
Any thought is highly appreciated. |
@leoarrow1 To better pinpoint the cause of the error, I would try to run |
Thanks! I will try so as soon as possible and let you know the result. |
FYI, the source code of ieugwasr::ld_matrix_local is as below for ld matrix calculation. It use plink for calculation but it does have the
It looks all right to me. Or is there any other option to do LD matrix calculation? |
@leoarrow1 I'm not familiar with the |
Is there a recommended method to compute LD matrix in ref panel for summary statistics? Thanks
----------
…--------------原始邮件--------------
发件人:"Peter Carbonetto ***@***.***>;
发送时间:2024年4月16日(星期二) 凌晨3:46
收件人:"stephenslab/susieR" ***@***.***>;
抄送:"leoarrow1 ***@***.***>;"Mention ***@***.***>;
主题:Re: [stephenslab/susieR] Also encountered "The estimated prior variance is unreasonably large" using ieugwasr ld_matrix_local function to calculate the ld matrix (Issue #223)
-----------------------------------
@leoarrow1 I'm not familiar with the ieugwasr R package (I have not used it before), but if you compute the LD matrix from the same genotype data used in computing the association statistics (e.g., z-scores), there should be no inconsistencies in your kriging_rss plot. On the other hand, if the LD matrix is computed using different genotype data, then it is quite possible to obtain large inconsistencies. Removing the largest inconsistencies may involve removing some SNPs, but we have not (yet) developed a systematic procedure for removing large inconsistencies. Hope this helps (a bit).
—
Reply to this email directly, view it on GitHub, or unsubscribe.
You are receiving this because you were mentioned.Message ID: ***@***.***>
|
@leoarrow1 It would depend whether you have access to the same data that was used to generate the association statistics. If the answer is no, then we do not have precise recommendations except that you should work with genotype data that most closely resembles the data used to generate the association statistics. Perhaps @gaow can provide more detailed recommendations; I believe he is working on bioinformatics pipelines to help with this. |
Thanks for reply we do not have Original genome type individual data, but I guess we can use the reference individual ref panel from 1 kg, which is frequently used as reference data in post gwas analysis?
Message ID: ***@***.***>
|
To debug your pipeline i would suggest hand checking some example snp
pairs. Eg find some snp pairs that have high ld in your computer matrix but
very different z scores. Such pairs are obviously problematic.
Maybe finding them will help you identify your problem.
Sorry, but i hope you understand we don't have time to help debug all user
pipelines.
Matthew
…On Fri, Apr 19, 2024, 12:59 AkhilP ***@***.***> wrote:
Hello, I am also facing a similar issue with ld matrix inconsistency. I
have used PLINK to generate LD matrices and kept --keep-allele-order. I am
using the meta effect of gwas analysis and using 1000g as a reference for
LD. My correlation plot is off. Can you help me out with this issue? Here
is the plot for the reference. I have used a 1000g complete dataset as the
gwas results are from multi-ancestry datasets.
image.png (view on web)
<https://github.com/stephenslab/susieR/assets/19439352/261d6b70-ef29-4e76-9bbc-af1123afade1>
I have also noticed a thing like variant which is shown to have
significant tophit for the loci is not available in the credible set. Is it
the opposite that we should have that top variant in the credible set?
Regards
Akhil
—
Reply to this email directly, view it on GitHub
<#223 (comment)>,
or unsubscribe
<https://github.com/notifications/unsubscribe-auth/AANXRRLAFQN7QVY3MJIMKZTY6FLRXAVCNFSM6AAAAABGBPRATGVHI2DSMVQWIX3LMV43OSLTON2WKQ3PNVWWK3TUHMZDANRXGA2DSMBSGA>
.
You are receiving this because you are subscribed to this thread.Message
ID: ***@***.***>
|
Thank you so much for the response. I understand its very difficult to focus on all pipelines given the tool usage and appreciate your kind response. I just want to know if the significant variant in the loci shouldl be present in the credible set or not. Regards |
The top snp should be in the strongest cs from my perspective. |
@leoarrow1 @akhilpampana these problems typically has to do with problematic bioinformatics steps to unify the alleles between summary statistics and LD. The discrepancy between z-scores and LD reference is also a secondary concern. We do have some pipelines that are still work in progress, to try cope with these issues. It contains two parts:
the 2nd item is still work in progress not ready for public use, although we have a separate branch of susieR to implement the prototype. The entire procedure is outlined here althought at this point it might be the most helpful to compare notes of your procedure with our basic QC for summary statistics. |
I think I have figured out my issue regarding the discrepancy of the plot but I will checkthrough the tutorial mentioned above just to be sure and test the analysis in the updated version of the ld matrix. Thank you so much @gaow for the pipeline and @leoarrow1 for the suggestion Regards |
Thank you for sharing @akhilpampana. If you could share in more detail what was the issue and how you solved it, this might be helpful for us (and others) to know. |
This will often be true, but sometimes not. See Fig. 1 of the susie paper for an illustration of this. |
Also encountered "The estimated prior variance is unreasonably large" using ieugwasr ld_matrix_local function to calculate the ld matrix. As I have checked the original code of ld_matrix_local function in ieugwasr package in GitHub, when calculating the ld matrix ld_matrix_local does have "--keep-allele-order" argument in the plink command. But such error still exists. Please advice.
The text was updated successfully, but these errors were encountered: