Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

I ran into a problem that the deep-cpf1 score generated from the command line is different from the results of the web version #61

Open
guomaoping opened this issue May 6, 2023 · 8 comments

Comments

@guomaoping
Copy link

guomaoping commented May 6, 2023

Hello, I ran into a problem that the deep-cpf1 score generated from the command line is different from the results of the web version( http://crispor.tefor.net/ ). Have you ever encountered this problem, what is the possible reason?
Thanks

@guomaoping guomaoping changed the title I ran into a problem that the deep-cpf1 score generated from the command line is different from the results of the web version, why? I ran into a problem that the deep-cpf1 score generated from the command line is different from the results of the web version May 6, 2023
@maximilianh
Copy link
Owner

maximilianh commented May 6, 2023 via email

@guomaoping
Copy link
Author

Thank you for your reply!
I use a 1000bp sequence as input, the command line is as follows:
"python3 $path/crisporWebsite/crispor.py --pam=ATTN --effScores=cpf1 --tempDir=./tmpdir hg38 1000bp.fa ATTN_1000bp_cli_scoreGuides.tsv"

The input sequence is as follows:
"ATTAATACTTTTAACAATTGTAGTATATAAAAAAGGGAGTAACCGAAAACGGTCGGGACCGAAAACGGTGTATATAAAAGATGTGAGAAACACACCACAATACTATGGCGCGCTTTGAGGATCCAACACGGCGACCCTACAAGCTACCTGATCTGTGCACGGAACTGAACACTTCACTGCAAGACATAGAAATAACCTGTGTATATTGCAAGACAGTATTGGAACTTACAGAGGTATTTGAATTTGCATTTAAAGATTTATTTGTGGTGTATAGAGACAGTATACCGCATGCTGCATGCCATAAATGTATAGATTTTTATTCTAGAATTAGAGAATTAAGACATTATTCAGACTCTGTGTATGGAGACACATTGGAAAAACTAACTAACACTGGGTTATACAATTTATTAATAAGGTGCCTGCGGTGCCAGAAACCGTTGAATCCAGCAGAAAAACTTAGACACCTTAATGAAAAACGACGATTTCACAACATAGCTGGGCACTATAGAGGCCAGTGCCATTCGTGCTGCAACCGAGCACGACAGGAACGACTCCAACGACGCAGAGAAACACAAGTATAATATTAAGTATGCATGGACCTAAGGCAACATTGCAAGACATTGTATTGCATTTAGAGCCCCAAAATGAAATTCCGGTTGACCTTCTATGTCACGAGCAATTAAGCGACTCAGAGGAAGAAAACGATGAAATAGATGGAGTTAATCATCAACATTTACCAGCCCGACGAGCCGAACCACAACGTCACACAATGTTGTGTATGTGTTGTAAGTGTGAAGCCAGAATTGAGCTAGTAGTAGAAAGCTCAGCAGACGACCTTCGAGCATTCCAGCAGCTGTTTCTGAACACCCTGTCCTTTGTGTGTCCGTGGTGTGCATCCCAGCAGTAAGCAACAATGGCTGATCCAGAAGGTACAGACGGGGAGGGCACGGGTTGTAACGGCTGGTTTTATGTACAAGCTATTGTAGACAAAAAAACAGGA"

The top 5 results of the command line are as follows:
1000bp_test 1forw ATTAATACTTTTAACAATTGTAGTATA -1 -1 30 NotEnoughFlankSeq GrafOK
1000bp_test 17forw ATTGTAGTATATAAAAAAGGGAGTAAC -1 -1 14 NotEnoughFlankSeq GrafOK
1000bp_test 98rev ATTGTGGTGTGTTTCTCACATCTTTTA -1 -1 15 62.107296 tt
1000bp_test 190rev ATTTCTATGTCTTGCAGTGAAGTGTTC -1 -1 7 38.360176 tt
1000bp_test 205forw ATTGCAAGACAGTATTGGAACTTACAG -1 -1 2 67.37283 GrafOK

The top 5 results of the web version are as follows:
#guideId targetSeq mitSpecScore cfdSpecScore offtargetCount targetGenomeGeneLocus DeepCpf1-Score grafType
1forw ATTAATACTTTTAACAATTGTAGTATA -1 -1 30 NotEnoughFlankSeq GrafOK
17forw ATTGTAGTATATAAAAAAGGGAGTAAC -1 -1 14 NotEnoughFlankSeq GrafOK
98rev ATTGTGGTGTGTTTCTCACATCTTTTA -1 -1 15 44.11671 tt
190rev ATTTCTATGTCTTGCAGTGAAGTGTTC -1 -1 7 25.683983 tt
205forw ATTGCAAGACAGTATTGGAACTTACAG -1 -1 2 51.02048 GrafOK

environment.txt
ATTN_1000bp_cli_scoreGuides.csv
ATTN_1000bp_web_scoreGuides.csv

@maximilianh
Copy link
Owner

maximilianh commented May 12, 2023 via email

@guomaoping
Copy link
Author

Thank you very much for your reply.
This sequence is a sequence intercepted from HPV18.
I have compared the versions of the deep learning packages in my installation environment, and they are the same as those specified in the "requirements.txt" file. The package versions are: keras==2.11.0, scikit-learn==1.2.2, scipy==1.10.1, tensorflow==2.11.0

However, I noticed that line 54 of the "INSTALL.md" file reads "I am using keras/tensorflow 2.1.1. I hope that the exact version is not important," which is inconsistent with "tensorflow==2.11.0, keras==2.11.0" in the "requirements.txt" file. Which version should I choose?

In addition, I compared the DeepCpf1 scores of the crispor.tefor.net version and the crispor.gi.ucsc.edu version, and found that they are indeed different.

The result from the command-line version that I calculated is consistent with the crispor.gi.ucsc.edu version, but different from the crispor.tefor.net version.
Here are the links to the results:
Result of the crispor.tefor.net version: http://crispor.tefor.net/crispor.py?batchId=ASazfpnUeH14sdYvIC02

Results of the crispor.gi.ucsc.edu version (the one you provided): http://crispor.gi.ucsc.edu/crispor.py?batchId=ASazfpnUeH14sdYvIC02

Thank you for your help and guidance on this matter.

@maximilianh
Copy link
Owner

maximilianh commented May 15, 2023 via email

@maximilianh
Copy link
Owner

maximilianh commented May 15, 2023 via email

@maximilianh
Copy link
Owner

Hi, any news from this? Were you able to confirm if the crispor.tefor.net scores are the correct ones?

I have another score, the saCas9 score, that has the same problem. I think I'll have to require a python2. The authors of that algorithm also use the Microsoft code and the Microsoft people are not available anymore to update the pickle files.

@maximilianh
Copy link
Owner

maximilianh commented Jun 22, 2023 via email

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants