LORE

A Literature Semantics Framework for Evidenced Disease-Gene Pathogenicity Prediction at Scale

Source code authors:

Li Peng-Hsuan (李朋軒) @ ailabs.tw (jacobvsdanniel [at] gmail.com)

Introduction

This repo hosts the source codes for LORE (LLM-based Open Relation Extraction and Embedding). We applied LORE to PubMed abstracts for large-scale understanding of disease-gene relationships and created the PMKB-CV knowledge graph. PMKB-CV contains 2K diseases, 600K disease-gene pairs, 11M disease-gene relations, embeddings, and predicted pathogenicity scores. This resource covers 200x more disease-gene pairs than ClinVar, and the predicted pathogenicity scores achieve an 80% Mean Average Precision (MAP) in ranking pathogenic genes for diseases.

For more details, see our paper:

Peng-Hsuan Li, Yih-Yun Sun, Hsueh-Fen Juan, Chien-Yu Chen, Huai-Kuang Tsai, and Jia-Hsin Huang. 2024. LORE: A Literature Semantics Framework for Evidenced Disease-Gene Pathogenicity Prediction at Scale.

The PMKB-CV knowledge graph is publicly available at:

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
gpt_direct_ask		gpt_direct_ask
key_semantics_curation		key_semantics_curation
llm_emb		llm_emb
llm_ore		llm_ore
ml_ranker		ml_ranker
LICENSE		LICENSE
README.md		README.md
gpt_direct_ask.py		gpt_direct_ask.py
key_semantics_curation.py		key_semantics_curation.py
llm_emb.py		llm_emb.py
llm_ore.py		llm_ore.py
ml_ranker.py		ml_ranker.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

LORE

Introduction

About

Releases

Packages

Languages

License

ailabstw/LORE

Folders and files

Latest commit

History

Repository files navigation

LORE

Introduction

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages