Name		Name	Last commit message	Last commit date
parent directory ..
docker		docker
docs		docs
examples		examples
figures		figures
model_cards		model_cards
notebooks		notebooks
runs		runs
scripts		scripts
src/transformers		src/transformers
templates		templates
tests		tests
utils		utils
CODE_OF_CONDUCT.md		CODE_OF_CONDUCT.md
CONTRIBUTING.md		CONTRIBUTING.md
ISSUES.md		ISSUES.md
LICENSE		LICENSE
MANIFEST.in		MANIFEST.in
Makefile		Makefile
README.md		README.md
adapter_houlsby_roberta_large_mnli.sh		adapter_houlsby_roberta_large_mnli.sh
adapter_pfeiffer_roberta_large_mnli.sh		adapter_pfeiffer_roberta_large_mnli.sh
deberta_v2_xxlarge_cola.sh		deberta_v2_xxlarge_cola.sh
deberta_v2_xxlarge_mnli.sh		deberta_v2_xxlarge_mnli.sh
deberta_v2_xxlarge_mrpc.sh		deberta_v2_xxlarge_mrpc.sh
deberta_v2_xxlarge_qnli.sh		deberta_v2_xxlarge_qnli.sh
deberta_v2_xxlarge_qqp.sh		deberta_v2_xxlarge_qqp.sh
deberta_v2_xxlarge_rte.sh		deberta_v2_xxlarge_rte.sh
deberta_v2_xxlarge_sst2.sh		deberta_v2_xxlarge_sst2.sh
deberta_v2_xxlarge_stsb.sh		deberta_v2_xxlarge_stsb.sh
ds_config.json		ds_config.json
environment.yml		environment.yml
hubconf.py		hubconf.py
mnli.cutoff.sh		mnli.cutoff.sh
mnli.rdrop.sh		mnli.rdrop.sh
pyproject.toml		pyproject.toml
roberta_base_cola.sh		roberta_base_cola.sh
roberta_base_lora_mnli.bin		roberta_base_lora_mnli.bin
roberta_base_mnli.sh		roberta_base_mnli.sh
roberta_base_mrpc.sh		roberta_base_mrpc.sh
roberta_base_qnli.sh		roberta_base_qnli.sh
roberta_base_qqp.sh		roberta_base_qqp.sh
roberta_base_rte.sh		roberta_base_rte.sh
roberta_base_sst2.sh		roberta_base_sst2.sh
roberta_base_stsb.sh		roberta_base_stsb.sh
roberta_large_cola.sh		roberta_large_cola.sh
roberta_large_lora_mnli.bin		roberta_large_lora_mnli.bin
roberta_large_mnli.sh		roberta_large_mnli.sh
roberta_large_mrpc.sh		roberta_large_mrpc.sh
roberta_large_qnli.sh		roberta_large_qnli.sh
roberta_large_qqp.sh		roberta_large_qqp.sh
roberta_large_rte.sh		roberta_large_rte.sh
roberta_large_sst2.sh		roberta_large_sst2.sh
roberta_large_stsb.sh		roberta_large_stsb.sh
setup.cfg		setup.cfg
setup.py		setup.py
valohai.yaml		valohai.yaml

README.md

Adapting RoBERTa and DeBERTa V2 using LoRA

This folder contains the implementation of LoRA in RoBERTa and DeBERTa V2 using the Python package lora. LoRA is described in the following pre-print:

LoRA: Low-Rank Adaptation of Large Language Models
Edward J. Hu*, Yelong Shen*, Phillip Wallis, Zeyuan Allen-Zhu, Yuanzhi Li, Shean Wang, Lu Wang, Weizhu Chen
Paper: https://arxiv.org/abs/2106.09685

Adapting to the GLUE Benchmark

Our experiments on the GLUE benchmark are run on 4 NVIDIA Tesla V100 GPU cards out of a DGX-1. The results may vary due to different GPU models, drivers, CUDA SDK versions, floating-point precisions, and random seeds. We report below the dev set results, taking the medium over 5 runs:

Here are the GLUE benchmark test set results for DeBERTa XXL 1.5B (no ensemble):

Download LoRA checkpoints

Dataset	RoBERTa base 125M LoRA - 0.3 M	RoBERTa large 355M LoRA - 0.8 M	DeBERTa XXL 1.5B LoRA - 4.7 M
MNLI	3.4 MB	7.1 MB	27.1 MB
SST2	3.4 MB	7.1 MB	27.1 MB
MRPC	3.4 MB	7.1 MB	27.1 MB
CoLA	3.4 MB	7.1 MB	27.1 MB
QNLI	3.4 MB	7.1 MB	27.1 MB
QQP	3.4 MB	7.1 MB	27.1 MB
RTE	3.4 MB	7.1 MB	27.1 MB
STSB	3.4 MB	7.1 MB	27.1 MB

Steps to reproduce our results

Create and activate conda env

conda env create -f environment.yml

Install the pre-requisites

lora:

pip install -e ..

NLU:

pip install -e .

Start the experiments

deberta_v2_xxlarge_mnli.sh
deberta_v2_xxlarge_sst2.sh
deberta_v2_xxlarge_mrpc.sh
deberta_v2_xxlarge_cola.sh
deberta_v2_xxlarge_qnli.sh
deberta_v2_xxlarge_qqp.sh
deberta_v2_xxlarge_rte.sh
deberta_v2_xxlarge_stsb.sh

For MRPC, RTE, and STSB, you need to download and start from the LoRA-adapted MNLI checkpoint and change the path accordingly in the shell script.

Attention: xxlarge-mnli is the LoRA-adapted model from our first MNLI experiments, instead of https://huggingface.co/microsoft/deberta-v2-xxlarge-mnli.

We also provide the shell scripts for roberta-base and roberta-large ( {roberta_large|roberta_base}_{task name}.sh ).

Evaluate the checkpoints

python -m torch.distributed.launch --nproc_per_node=1 examples/text-classification/run_glue.py \
--model_name_or_path microsoft/deberta-v2-xxlarge \
--lora_path ./deberta_v2_xxlarge_lora_mnli.bin \
--task_name mnli \
--do_eval \
--output_dir ./output \
--apply_lora \
--lora_r 16 \
--lora_alpha 32

Enable Cutoff/R-drop for data augmentation

mnli.cutoff.sh
mnli.rdrop.sh

Citation

@misc{hu2021lora,
    title={LoRA: Low-Rank Adaptation of Large Language Models},
    author={Hu, Edward and Shen, Yelong and Wallis, Phil and Allen-Zhu, Zeyuan and Li, Yuanzhi and Wang, Lu and Chen, Weizhu},
    year={2021},
    eprint={2106.09685},
    archivePrefix={arXiv},
    primaryClass={cs.CL}
}

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

NLU

NLU

README.md

Adapting RoBERTa and DeBERTa V2 using LoRA

Adapting to the GLUE Benchmark

Download LoRA checkpoints

Steps to reproduce our results

Create and activate conda env

Install the pre-requisites

Start the experiments

Evaluate the checkpoints

Enable Cutoff/R-drop for data augmentation

Citation

Files

NLU

Directory actions

More options

Directory actions

More options

Latest commit

History

NLU

Folders and files

parent directory

README.md

Adapting RoBERTa and DeBERTa V2 using LoRA

Adapting to the GLUE Benchmark

Download LoRA checkpoints

Steps to reproduce our results

Create and activate conda env

Install the pre-requisites

Start the experiments

Evaluate the checkpoints

Enable Cutoff/R-drop for data augmentation

Citation