GasketRAG: Systematic Alignment of Large Language Models with Retrievers

Install

Make sure pixi is already installed, then the environment can be conveniently installed with:

pixi install

Download the generator model Llama3-8B-baseline, ColBERT index and datasets from https://github.com/fate-ubw/RAGLAB.

Preference Data Collection

Set your OpenAI API key in api_keys.txt.

Start ColBERT server:

sh run/colbert_server_wiki2018.sh

Raise the generator LLM with vLLM server:

sh run/generator_vllm.sh

Run preference data collection:

pixi run python labeller.py
cat data/labelled_training_data/triviaqa-labelled.jsonl > data/labelled_training_data/train_all.jsonl
cat data/labelled_training_data/hotpot-labelled.jsonl >> data/labelled_training_data/train_all.jsonl
pixi run python process_all_train_jsonl.py

KTO Train

The base model is meta-llama/Llama-3.1-8B-Instruct.

sh run/run_kto_llama.sh

Evaluation

Start up the gasket model:

sh run/gasket_vllm.sh

Also start the ColBERT server and the generator LLM.

Run the evaluation:

sh run/run_exp.sh

Name		Name	Last commit message	Last commit date
Latest commit History 1 Commit
config/gasketrag		config/gasketrag
rag		rag
run		run
.gitignore		.gitignore
README.md		README.md
api_keys.txt		api_keys.txt
kto_llama.py		kto_llama.py
kto_trainer.py		kto_trainer.py
labeller.py		labeller.py
main-evaluation.py		main-evaluation.py
pixi.lock		pixi.lock
pixi.toml		pixi.toml
process_all_train_jsonl.py		process_all_train_jsonl.py
utils.py		utils.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

GasketRAG: Systematic Alignment of Large Language Models with Retrievers

Install

Preference Data Collection

KTO Train

Evaluation

About

Releases

Packages

Languages

LiinXemmon/GasketRAG

Folders and files

Latest commit

History

Repository files navigation

GasketRAG: Systematic Alignment of Large Language Models with Retrievers

Install

Preference Data Collection

KTO Train

Evaluation

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages