-
Notifications
You must be signed in to change notification settings - Fork 18
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[WIP] T4 Lysozyme input files #66
base: master
Are you sure you want to change the base?
Conversation
I plan to upload the solvated systems soon. A few questions:
|
Thanks, @andrrizzi .
I think it'd be good to also have the PDBs since these are very helpful for some codes and also could assist if it is necessary to prepare other formats.
The short answer is: Please deposit them, and update the supplemental info to also list details of these. (See also work done on cyclodextrins and some of the other sets). We can update the paper to note this is available. The main reason the current set has a low number of ligands is that we were, in this work, primarily focused on issues roughly like those addressed in the SAMPLing challenge -- assessing methods to ensure whether and how well they converge to known gold standard answers (when available) or at least how well they deal with the issues which are known to be important -- and less on accuracy relative to experiment. If you're interested in method efficiency separated from force field issues, and thus not in comparing to experiment, the number of ligands is less of a factor (as in the case of the SAMPLing challenge). Eventually I need to update the paper to make this point and the terminology slightly more clear on this as John was confused on this too, but I haven't had time. |
Oh, and also @andrrizzi
Good point. Any suggestions? One approach would be for me to make a separate data package in eScholarship which would get its own DOI and we could link to that from here. That's not under automatic version control (though I can upload versioned .tar.gz files), but it's free and the UC libraries maintain it so it's safe for the long haul. There are probably also other options but presumably they would be paid. |
Sounds good, thanks!
Nothing specific comes to mind right now. I agree that we want to have versioned input files. If we want to keep things free, an alternative to manual versioning could be to split this into multiple github repos (e.g. |
@andrrizzi - we are good to go now, I've added the 50 GB so you should be able to add those files. |
Awesome! Thank you! |
Still missing the README/paper update, and the layout may need a little polishing but the files are here. I'll try to wrap up this PR next week. |
@andrrizzi - any updates? |
Sorry, I got caught up in other work. I'll try to find some time next week to finish this up. |
@mkgilson I've uploaded in this branch the receptors in PDB format and ligands in mol2 format.
The receptors are clean (no crystal waters) and they went through MCCE to obtain the most relevant protonation states at the experimental pH. The ligands have been generated from the SMILES and docked into the receptors with OpenEye. The protonation state is the one Epik predicted. Each folder has a JSON file describing each ligand with associated reference and experimental conditions/measurements.
I haven't had the chance to put together a README and to organize the prmtop/inpcrd yet, but as far as I understood, the PDB files are the priority at this stage. I'll try to get the rest in by the end of the week.