Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Multiple records with the same position in the golden VCF file #81

Open
mohammedkhalfan opened this issue Sep 2, 2020 · 1 comment
Open

Comments

@mohammedkhalfan
Copy link

Describe the bug
Depending on the mutation rate, NEAT will produce a golden VCF with multiple variants found per position. If you then give this VCF back to NEAT with the -v option, NEAT will skip inserting these variants and tell you "X variants skipped due to multiple variants found per position". Should NEAT be producing a golden VCF with multiple variants found per position? This doesn't happen when the mutation rate is very low, but happens when M = 4.5% for example. Ex:

SARS-CoV2       1443    .       T       A       .       PASS    WP=1
SARS-CoV2       1443    .       T       C       .       PASS    WP=1

To Reproduce
Run NEAT genReads.py with a mutation model (-m) and set the mutation rate (-M) to 0.045.

Expected behavior
I don't expect there to be multiple variants at the same position in the golden VCF.

Screenshots
Here is an the error from NEAT if you provide the VCF as input:

reading input VCF...
found 1278 valid variants in input vcf.
 * 0 variants skipped: (qual filtered / ref genotypes / invalid syntax)
 * 12 variants skipped due to multiple variants found per position

Desktop (please complete the following information):
NEAT-genReads V2.0

@jdonzallaz
Copy link

Hi, I have the same problem using default mutation rate/model. Any update ?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants