COS30018-Mitigate-Hallucination

COS30018 - Intelligent Systems. Learning and applying hallucination mitigation techniques on open LLMs.

Requirements

Developers are advised to use a virtual environment to install the required packages. Avoid using Hugging Face with Keras or TensorFlow 3+ as it may cause compatibility issues. You can create a virtual environment with conda or venv.

This project run on Python 3.12.3.

1. PyTorch for backend framework

Using pip

pip install torch torchvision torchaudio --index-url https://download.pytorch.org/whl/cu124

Or using conda

conda install pytorch torchvision torchaudio pytorch-cuda=12.4 -c pytorch -c nvidia

Verify that you install PyTorch correctly with CUDA support by running the following line and receive True:

python -c "import torch; print(torch.cuda.is_available())"

2. 🤗 Transformers for high-level APIs and other dependencies

For Hugging Face: transformers, datasets, evaluate, accelerate, huggingface_hub[cli], bitsandbytes
For evaluation: scikit-learn, sacrebleu, rouge-score, git+https://github.com/google-research/bleurt.git, pandas, numpy
For fine-tuning: peft, trl
For hallucination detection: selfcheckgpt nltk rouge spacy tensorflow
For visualization: gradio

pip install transformers==4.45.1 datasets evaluate accelerate huggingface_hub[cli] bitsandbytes peft trl scikit-learn sacrebleu rouge-score git+https://github.com/google-research/bleurt.git selfcheckgpt nltk rouge spacy tensorflow pandas numpy gradio

3. IPython for Jupyter Notebook in VSCode

pip install ipython ipywidgets

Usage

This repository is structured as a Python package. You should run any file from the parent directory by using the -m flag.

For example, to run the COS30018-Mitigate-Hallucination/hallucination_detection/evaluation/halueval.py file, you should run the following command:

python -m COS30018-Mitigate-Hallucination.hallucination_detection.evaluation.halueval

Hugging Face Access Token

Llama 3 is a private model, and you need to have access to it.

Go to the Hugging Face website and create an account.
Go to the Llama 3 model page and request access.
Once you have access, go to your user settings and create a new access token (read token is enough).
Copy the access token, open terminal in your python environment and run the following command:

huggingface-cli login –token <your-access-token> --add-to-git-credential”

Name		Name	Last commit message	Last commit date
Latest commit History 282 Commits
Evaluation		Evaluation
Final Evaluation		Final Evaluation
Finetuning		Finetuning
__pycache__/projectC		__pycache__/projectC
hallucination_detection		hallucination_detection
self_refine		self_refine
utilities		utilities
.DS_Store		.DS_Store
.gitignore		.gitignore
README.md		README.md
__init__.py		__init__.py
app.py		app.py
example.ipynb		example.ipynb
example.py		example.py
test_fewshot_template.py		test_fewshot_template.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

COS30018-Mitigate-Hallucination

Requirements

1. PyTorch for backend framework

2. 🤗 Transformers for high-level APIs and other dependencies

3. IPython for Jupyter Notebook in VSCode

Usage

Hugging Face Access Token

About

Releases

Packages

Contributors 4

Languages

khangdzox/COS30018-Mitigate-Hallucination

Folders and files

Latest commit

History

Repository files navigation

COS30018-Mitigate-Hallucination

Requirements

1. PyTorch for backend framework

2. 🤗 Transformers for high-level APIs and other dependencies

3. IPython for Jupyter Notebook in VSCode

Usage

Hugging Face Access Token

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 4

Languages

Packages