Skip to content

Commit

Permalink
Update README.md
Browse files Browse the repository at this point in the history
PN269 authored Jun 6, 2024

Verified

This commit was created on GitHub.com and signed with GitHub’s verified signature.
1 parent 07c74b4 commit f55b102
Showing 1 changed file with 1 addition and 1 deletion.
2 changes: 1 addition & 1 deletion README.md
Original file line number Diff line number Diff line change
@@ -1,7 +1,7 @@
# Leveraging a large language model to predict protein phase transition: a physical, multiscale and interpretable approach
We apply a unified modeling framework to predict protein phase transition (PPT). In classification task A proteins exhibiting experimental evidence of undergoing a phase transition, forming either droplets or amyloids, are consolidated into a single dataset (+Droplet drivers and +Amyloids). Phase transition propensity is predicted versus the preference to maintain the native soluble state (-PT). In classification task B the unified dataset is utilized to predict the propensity to form droplets versus amyloid aggregates. To accomplish this, we fine-tune the ESM-2 model to predict PPT and compare its performance to biophysical knowledge-based models (e.g., random forest).

![Alt text](./Files/Schematic.png?raw=true "Title")
![Alt text](./Files/schematic.png?raw=true "Title")
This repository provides code for predicting protein phase transition (PPT) propensity, including two examples of AD-related proteins, their associated genes, and transcription factors.

## Folder and related notebooks:

0 comments on commit f55b102

Please sign in to comment.