How to interpret the number of anomaly counts? #123
-
Hi :^) I really like the ‘gimble query’ function, which gives a very clean summary of the ‘gimble optimize’ outputs. However, I was wondering how I shall interpret the number of anomaly counts (e.g., does a large number of anomaly counts mean that a model fits badly, despite it shows a high lnCL value compared to the other models?) Detailed descriptions: For all IM models (regardless of which dataset I used), they tend to give high anomaly counts (around 5000 anomalies out of 10000) – I feel that cannot be a good thing (?) Is there any way that I could improve this abnormal behaviour? Very looking forward to your insightful advice, and many thanks in advance! Gratefully, |
Beta Was this translation helpful? Give feedback.
Replies: 1 comment 1 reply
-
Hi @MengLu-flw, The reported anomalies have to do with numerical stability issues that are sometimes encountered when computing the probabilities of seeing particular mutation configurations given a certain set of parameter combinations. These probabilities are computed using the |
Beta Was this translation helpful? Give feedback.
Hi @MengLu-flw,
Thanks for flagging this. We should indeed provide a bit more explanation on what the reported number of anomalies mean.
The reported anomalies have to do with numerical stability issues that are sometimes encountered when computing the probabilities of seeing particular mutation configurations given a certain set of parameter combinations. These probabilities are computed using the
agemo
library. A little bit of work is required on that front to improve the numerical stability (see this issue). There is nothing you can do really to improve this, apart perhaps from using a different set of random seeds to explore parameter space. However, given the large number of anomalie…