Skip to content

Why each tree contain a very small number of SNPs #690

Answered by hyanwong
silvewheat asked this question in Q&A
Discussion options

You must be logged in to vote

That seems about right, no? If recombination and mutation rates are roughly equal, then we would expect 1 site per tree? He's an example from simulated data:

import msprime
import numpy as np

np.mean([
    ts.num_trees / ts.num_sites
     for ts in msprime.simulate(
         100, length=1e8, recombination_rate=1e-8, mutation_rate=1e-8, num_replicates=100
     )]
)  # should be about 1

Because edges span across many trees, it's fine to have few mutations on each specific tree. What you really want (but might not get) is one mutation per edge.

Replies: 2 comments 3 replies

Comment options

You must be logged in to vote
1 reply
@hyanwong
Comment options

Answer selected by silvewheat
Comment options

You must be logged in to vote
2 replies
@hyanwong
Comment options

@silvewheat
Comment options

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Category
Q&A
Labels
None yet
2 participants