Compare LLMs Workflow

This repository demonstrates how to use the Pegasus Workflow Management System (WMS) to compare the accuracy of Supervised Fine-Tuning (SFT) and pretrained models available in HuggingFace model repository.

Overview

Pegasus WMS is a powerful tool for managing and executing complex workflows on distributed computing resources. In this example, we use its abilities to run computations simultaneously, making it easier to compare various models efficiently.

File description:

prepare.py - fetches and prepares the dataset
evaluate.py - evaluates performance of single model
aggregate.py - aggregates results of evaluation steps
workflow.py - builds and submits workflow

Dataset

The example included in this repository utilizes the Yelp review dataset. This dataset contains reviews along with their associated ratings, making it suitable for training and evaluating various natural language processing models.

Usage

To run the example:

Clone this repository to your Linux machine.
Create a virtual environment using Python:

python3 -m venv env
source env/bin/activate

Install requirements

pip install -r requirements.txt

Run workflow

./workflow.py --models bert-base-cased albert-base-v2 --batch-size 8

Container

The workflow uses Singularity containers to execute each step. In the default setup, a container is created using the prebuilt Docker image from DockerHub.

To build the Singularity image locally, execute the following commands:

docker build -t compare-llms-workflow .
singularity build base.sif docker-daemon://compare-llms-workflow:latest

Next, specify the path to the built image using the --image option.

./workflow.py --image $PWD/base.sif ...

Results

Once the computations are finished, the results will be aggregated into agg.csv and rendered as plots for easy interpretation (agg.pdf)

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
.pegasushub.yml		.pegasushub.yml
Dockerfile		Dockerfile
README.md		README.md
aggregate.py		aggregate.py
evaluate.py		evaluate.py
prepare.py		prepare.py
requirements.txt		requirements.txt
workflow.py		workflow.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Compare LLMs Workflow

Overview

File description:

Dataset

Usage

Container

Results

About

Releases

Packages

Languages

pegasus-isi/compare-large-language-models-workflow

Folders and files

Latest commit

History

Repository files navigation

Compare LLMs Workflow

Overview

File description:

Dataset

Usage

Container

Results

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages