Distributed fine tuning of LLMs #49

Shreyanand · 2023-05-25T14:51:04Z

The finetuning notebook uses 1 GPU and LoRA technique to fine tune a T5 model with 3B parameters. The task to be completed in this issue is to fine tune the same model (or 7B version of the model) on multiple GPU nodes. Use Instascale and Codeflare to schedule the training job and retrieve the finetuned model. Create a notebook that demos this.

This was referenced May 25, 2023

[EPIC] Adapting Foundation Models #38

Open

[WIP] Codeflare and Ray for distributing fine tuning #50

Draft

Shreyanand changed the title ~~How can we fine tune small models with limited resources, and can we use ray?~~ Distributed fine tuning of LLMs Jul 13, 2023

Shreyanand self-assigned this Jul 18, 2023

Shreyanand added the help wanted Extra attention is needed label Mar 22, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Distributed fine tuning of LLMs #49

Distributed fine tuning of LLMs #49

Shreyanand commented May 25, 2023 •

edited

Loading

Distributed fine tuning of LLMs #49

Distributed fine tuning of LLMs #49

Comments

Shreyanand commented May 25, 2023 • edited Loading

Shreyanand commented May 25, 2023 •

edited

Loading