-
Notifications
You must be signed in to change notification settings - Fork 5
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
causal sites #53
Comments
Thanks for your comment, and I think it sounds like a great idea. I will try modifying the codes and post a pull request about it really soon. |
Just a suggestion to make sure the API is future proof, even when we have not implemented all possibilities just yet. |
Instead of changing |
That would make sense. The main goal here is I think is to add a layer of realism to the simulations by specifying regions of the genome on which we might have prior knowledge. The obvious thing being coding regions. The simplest way would probably be the way you propose using a list of |
I think providing the option to specify the actual causal sites is the most flexible thing, that way users can do arbitrarily complex things to choose them. There's no point in getting into choosing randomly from some other set, since that's trivial to do using numpy anyway. |
I just made a modification to the |
The current API assumes we provide
num_causal_sites
. It might make more sense to turn this intocausal_sites
and allow both integer values as well as numpy arrays. The case of an integer is the currently implemented behaviour. In case of an array, these should all be sites contained in the sites table. We would then no longer randomly pick sites.Alternatively, it might make more sense to provide an example in the documentation on how to mask certain regions of the genome with tree sequences, in case we have knowledge of coding vs non-coding regions for example.
The text was updated successfully, but these errors were encountered: