The prediction results of Chai-1 protein-ligand were significantly different from those of RoseTTA-Fold-All-Atom #42

biochristmas · 2024-09-13T03:24:31Z

output of Chai-1

output alignment

output of RFAA

Hi, in order to test the Chai-1, I used the protein-ligand example of RoseTTAFold-All-Atom for testing, but the result seemed to be quite different from the predicted result of RoseTTAFold-All-Atom, and the RMSD of the comparison of the two structures was 21.452 angstroms. Is this a normal situation? The following is the content of the fasta file I predict to use:

protein|7QXR_1
TATGDEWWAKCKQVDVLDSEMSYYDSDPGKHKNTVIFLHGNPTSSYLWRNVIPHVEPLARCLAPDLIGMGKSGKLPNHSYRFVDHYRYLSAWFDSVNLPEKVTIVCHDWGSGLGFHWCNEHRDRVKGIVHMESVVDVIESWDEWPDIEEDIALIKSEAGEEMVLKKNFFIERLLPSSIMRKLSEEEMDAYREPFVEPGESRRPTLTWPREIPIKGDGPEDVIEIVKSYNKWLSTSKDIPKLFINADPGFFSNAIKKVTKNWPNQKTVTVKGLHFLQEDSPEEIGEAIADFLNELT
ligand|NSW
c1c(ccc(Cn2[nH]c3c(Cc4ccccc4)nc(c4ccc(cc4)O)c[n+]3c2=O)c1)O

biochristmas · 2024-09-13T03:40:35Z

aggregate_score:
[0.15987574]

ptm:
[0.27837044]

iptm:
[0.13025206]

per_chain_ptm:
[[0.27259728, 0.7072335 ]]

per_chain_pair_iptm:
[[[0.27259728, 0.03284407],
[0.13025206, 0.7072335 ]]]

has_clashes:
[0.]

per_chain_intra_clashes:
[[0., 0.]]

per_chain_pair_inter_clashes:
[[[0., 0.],
[0., 0.]]]

This is the score information of the predicted result, it seems that ipTM is very low, is it because I manually updated several scripts that have been updated in the github repository recently?

kimdn · 2024-09-13T16:34:01Z

If Chai-1 results in more accurate prediction than RFAA, then no wonder they differ?
https://www.chaidiscovery.com/blog/introducing-chai-1

navvye · 2024-09-13T16:53:24Z

Interesting. Do you have a ground truth so that you can compare the two predictions?

biochristmas · 2024-09-13T16:56:02Z

I performed a comparison, and the results show that the structure provided by RFAA is closer to the experimental structure in the PDB. This discrepancy might be due to my not using MSA information during the Chai-1 run. I am currently exploring how to incorporate MSA information and would greatly appreciate any suggestions you could offer.

arogozhnikov · 2024-09-14T01:12:38Z

@biochristmas we'll add some examples how to pass MSAs, but the simplest way right now is to use web server - search for MSAs is automated there.

Here is your example vs PDB when I run it on server:

arogozhnikov · 2024-09-14T01:14:48Z

also @biochristmas from you description it isn't clear how many samples your generated and how you selected best sample (that's what server does by default, but in code we return all samples)

biochristmas · 2024-09-14T01:29:49Z

Thank you very much for your prompt response. After modifying the input and output paths in example/predict_structure.py, I ran predictions on the case mentioned earlier and generated a total of 5 samples. However, the ipTM scores in the resulting npz files appear to be quite low. I am very much looking forward to the example on how to pass the MSA.

biochristmas · 2024-09-14T01:33:46Z

scores.model_idx_0.txt
scores.model_idx_1.txt
scores.model_idx_2.txt
scores.model_idx_3.txt
scores.model_idx_4.txt

biochristmas · 2024-09-14T01:43:52Z

output_pdb.zip
This is the PDB file for the five output results I obtained from the run.

GXcells · 2024-09-16T21:01:28Z

arogozhnikov

Was there MSA here ? Is it ON by default on the web server? Would be interesting to see with/without MSA on web server to compare to @biochristmas local results

YangPH0624 · 2024-10-03T09:28:46Z

also @biochristmas from you description it isn't clear how many samples your generated and how you selected best sample (that's what server does by default, but in code we return all samples)

You have done an amazing job, but the server has recently not supported MSA. When will it be possible to perform MSA locally?

jackdent · 2024-10-04T01:20:31Z

@YangPH0624 please see this issue: #73

arogozhnikov · 2024-10-19T06:52:53Z

Closing, MSA/MSAContext discussion should be held in #73, and track corresponding PR in #109

arogozhnikov added the question Further information is requested label Sep 14, 2024

wukevin mentioned this issue Oct 15, 2024

Support for MSA contexts and .aligned.pqt format #109

Merged

arogozhnikov closed this as completed Oct 19, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

The prediction results of Chai-1 protein-ligand were significantly different from those of RoseTTA-Fold-All-Atom #42

The prediction results of Chai-1 protein-ligand were significantly different from those of RoseTTA-Fold-All-Atom #42

biochristmas commented Sep 13, 2024

biochristmas commented Sep 13, 2024

kimdn commented Sep 13, 2024 •

edited

Loading

navvye commented Sep 13, 2024

biochristmas commented Sep 13, 2024

arogozhnikov commented Sep 14, 2024 •

edited

Loading

arogozhnikov commented Sep 14, 2024

biochristmas commented Sep 14, 2024

biochristmas commented Sep 14, 2024

biochristmas commented Sep 14, 2024

GXcells commented Sep 16, 2024

YangPH0624 commented Oct 3, 2024

jackdent commented Oct 4, 2024

arogozhnikov commented Oct 19, 2024 •

edited

Loading

The prediction results of Chai-1 protein-ligand were significantly different from those of RoseTTA-Fold-All-Atom #42

The prediction results of Chai-1 protein-ligand were significantly different from those of RoseTTA-Fold-All-Atom #42

Comments

biochristmas commented Sep 13, 2024

biochristmas commented Sep 13, 2024

kimdn commented Sep 13, 2024 • edited Loading

navvye commented Sep 13, 2024

biochristmas commented Sep 13, 2024

arogozhnikov commented Sep 14, 2024 • edited Loading

arogozhnikov commented Sep 14, 2024

biochristmas commented Sep 14, 2024

biochristmas commented Sep 14, 2024

biochristmas commented Sep 14, 2024

GXcells commented Sep 16, 2024

YangPH0624 commented Oct 3, 2024

jackdent commented Oct 4, 2024

arogozhnikov commented Oct 19, 2024 • edited Loading

kimdn commented Sep 13, 2024 •

edited

Loading

arogozhnikov commented Sep 14, 2024 •

edited

Loading

arogozhnikov commented Oct 19, 2024 •

edited

Loading