Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

The prediction results of Chai-1 protein-ligand were significantly different from those of RoseTTA-Fold-All-Atom #42

Closed
biochristmas opened this issue Sep 13, 2024 · 13 comments · Fixed by #109
Labels
question Further information is requested

Comments

@biochristmas
Copy link

image
output of Chai-1
image
output alignment
image
output of RFAA

Hi, in order to test the Chai-1, I used the protein-ligand example of RoseTTAFold-All-Atom for testing, but the result seemed to be quite different from the predicted result of RoseTTAFold-All-Atom, and the RMSD of the comparison of the two structures was 21.452 angstroms. Is this a normal situation? The following is the content of the fasta file I predict to use:

protein|7QXR_1
TATGDEWWAKCKQVDVLDSEMSYYDSDPGKHKNTVIFLHGNPTSSYLWRNVIPHVEPLARCLAPDLIGMGKSGKLPNHSYRFVDHYRYLSAWFDSVNLPEKVTIVCHDWGSGLGFHWCNEHRDRVKGIVHMESVVDVIESWDEWPDIEEDIALIKSEAGEEMVLKKNFFIERLLPSSIMRKLSEEEMDAYREPFVEPGESRRPTLTWPREIPIKGDGPEDVIEIVKSYNKWLSTSKDIPKLFINADPGFFSNAIKKVTKNWPNQKTVTVKGLHFLQEDSPEEIGEAIADFLNELT
ligand|NSW
c1c(ccc(Cn2[nH]c3c(Cc4ccccc4)nc(c4ccc(cc4)O)c[n+]3c2=O)c1)O

@biochristmas
Copy link
Author

aggregate_score:
[0.15987574]

ptm:
[0.27837044]

iptm:
[0.13025206]

per_chain_ptm:
[[0.27259728, 0.7072335 ]]

per_chain_pair_iptm:
[[[0.27259728, 0.03284407],
[0.13025206, 0.7072335 ]]]

has_clashes:
[0.]

per_chain_intra_clashes:
[[0., 0.]]

per_chain_pair_inter_clashes:
[[[0., 0.],
[0., 0.]]]

This is the score information of the predicted result, it seems that ipTM is very low, is it because I manually updated several scripts that have been updated in the github repository recently?

@kimdn
Copy link

kimdn commented Sep 13, 2024

If Chai-1 results in more accurate prediction than RFAA, then no wonder they differ?
https://www.chaidiscovery.com/blog/introducing-chai-1

@navvye
Copy link

navvye commented Sep 13, 2024

Interesting. Do you have a ground truth so that you can compare the two predictions?

@biochristmas
Copy link
Author

I performed a comparison, and the results show that the structure provided by RFAA is closer to the experimental structure in the PDB. This discrepancy might be due to my not using MSA information during the Chai-1 run. I am currently exploring how to incorporate MSA information and would greatly appreciate any suggestions you could offer.

@arogozhnikov
Copy link
Contributor

arogozhnikov commented Sep 14, 2024

@biochristmas we'll add some examples how to pass MSAs, but the simplest way right now is to use web server - search for MSAs is automated there.

Here is your example vs PDB when I run it on server:

screenshot_2024-09-12_at_9 55 11___pm_720

@arogozhnikov
Copy link
Contributor

also @biochristmas from you description it isn't clear how many samples your generated and how you selected best sample (that's what server does by default, but in code we return all samples)

@arogozhnikov arogozhnikov added the question Further information is requested label Sep 14, 2024
@biochristmas
Copy link
Author

Thank you very much for your prompt response. After modifying the input and output paths in example/predict_structure.py, I ran predictions on the case mentioned earlier and generated a total of 5 samples. However, the ipTM scores in the resulting npz files appear to be quite low. I am very much looking forward to the example on how to pass the MSA.

@biochristmas
Copy link
Author

image
scores.model_idx_0.txt
scores.model_idx_1.txt
scores.model_idx_2.txt
scores.model_idx_3.txt
scores.model_idx_4.txt

@biochristmas
Copy link
Author

output_pdb.zip
This is the PDB file for the five output results I obtained from the run.

@GXcells
Copy link

GXcells commented Sep 16, 2024

arogozhnikov

Was there MSA here ? Is it ON by default on the web server? Would be interesting to see with/without MSA on web server to compare to @biochristmas local results

@YangPH0624
Copy link

also @biochristmas from you description it isn't clear how many samples your generated and how you selected best sample (that's what server does by default, but in code we return all samples)

You have done an amazing job, but the server has recently not supported MSA. When will it be possible to perform MSA locally?

@jackdent
Copy link
Contributor

jackdent commented Oct 4, 2024

@YangPH0624 please see this issue: #73

@arogozhnikov
Copy link
Contributor

arogozhnikov commented Oct 19, 2024

Closing, MSA/MSAContext discussion should be held in #73, and track corresponding PR in #109

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
question Further information is requested
Projects
None yet
Development

Successfully merging a pull request may close this issue.

7 participants