Support finetuning with LoRA #431

katalinic-gc · 2023-06-26T09:08:31Z

What does this PR do?

To enable it, apply below, which is basically identical to upstream.

config = LoraConfig(
    r=16, 
    lora_alpha=32, 
    target_modules=["q_proj", "v_proj"], 
    lora_dropout=0.05, 
    bias="none"
)
model = get_peft_model(model, config)
model.print_trainable_parameters()

Some online finetuning walkthroughs also include

    remove_unused_columns=False,  # required as the PeftModel forward doesn't have the signature of the wrapped model's forward
    label_names=["labels"],  # same reason as above

to the training args, e.g. (IPU)Seq2SeqTrainingArguments.

Before submitting

This PR fixes a typo or improves the docs (you can dismiss the other checks if that's the case).
Did you make sure to update the documentation with your changes?
Did you write any new necessary tests?

HuggingFaceDocBuilderDev · 2023-06-26T09:18:16Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint.

optimum/graphcore/pipelines/__init__.py

notebooks/whisper_lora.ipynb

rrva · 2023-08-20T15:12:36Z

@katalinic-gc I would like to use this to finetune whisper-large-v2 on a fairly large dataset (700k examples). Is this usable as is (if I fix the merge conflicts)? What steps from https://www.graphcore.ai/posts/fine-tune-openais-whisper-automatic-speech-recognition-asr-model would need to be different with this approach?

katalinic-gc · 2023-08-21T08:51:12Z

@katalinic-gc I would like to use this to finetune whisper-large-v2 on a fairly large dataset (700k examples). Is this usable as is (if I fix the merge conflicts)? What steps from https://www.graphcore.ai/posts/fine-tune-openais-whisper-automatic-speech-recognition-asr-model would need to be different with this approach?

It won't be usable for large on IPUs due to OOM. We are internally working on supporting that; if and when available, we'll announce it.

katalinic-gc force-pushed the lora branch from ab51252 to fa3dba7 Compare July 5, 2023 17:41

katalinic-gc commented Jul 5, 2023

View reviewed changes

optimum/graphcore/pipelines/__init__.py Show resolved Hide resolved

katalinic-gc commented Jul 5, 2023

View reviewed changes

notebooks/whisper_lora.ipynb Outdated Show resolved Hide resolved

katalinic-gc changed the title ~~Minimal support for Whisper finetuning with LoRA~~ Whisper finetuning with LoRA Jul 5, 2023

katalinic-gc added 7 commits August 17, 2023 11:13

Add support for LoRA models

acf2ed3

Workaround random bug

a5c9ef1

Pass only trainable params to the optimizer in trainer

326984b

Add peft dependency

f4b0362

Fix edge case in peft + pipelines

6a443b0

Fix dropout check

98cb02a

Add notebook for Whisper LoRA

60fa56c

katalinic-gc marked this pull request as draft August 22, 2023 15:03

katalinic-gc force-pushed the lora branch from fa3dba7 to 60fa56c Compare September 1, 2023 06:59

katalinic-gc added 2 commits September 1, 2023 08:48

Simplify pipelines logic, requiring adapter weights to be merged in

7427191

Remove notebook for now

8f2065b

katalinic-gc marked this pull request as ready for review September 6, 2023 17:15

katalinic-gc changed the title ~~Whisper finetuning with LoRA~~ Support finetuning with LoRA Sep 6, 2023

katalinic-gc merged commit 8c4a1dd into huggingface:main Sep 6, 2023
2 of 3 checks passed

katalinic-gc deleted the lora branch September 6, 2023 18:40

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Support finetuning with LoRA #431

Support finetuning with LoRA #431

katalinic-gc commented Jun 26, 2023 •

edited

Loading

HuggingFaceDocBuilderDev commented Jun 26, 2023

rrva commented Aug 20, 2023

katalinic-gc commented Aug 21, 2023

Support finetuning with LoRA #431

Support finetuning with LoRA #431

Conversation

katalinic-gc commented Jun 26, 2023 • edited Loading

What does this PR do?

Before submitting

HuggingFaceDocBuilderDev commented Jun 26, 2023

rrva commented Aug 20, 2023

katalinic-gc commented Aug 21, 2023

katalinic-gc commented Jun 26, 2023 •

edited

Loading