Spectrum training support #2504

ggbetz · 2024-12-19T15:42:09Z

Support spectrum training in sft script / cli, e.g. by passing unfrozen_parameters via yaml obtained with spectrum snr analysis:

trl sft \
--model_name_or_path meta-llama/Llama-3.1-8B-Instruct \
--unfrozen_parameters snr_results_Llama-3.1-8B_50percent.yaml \
...

I'd love to use the huggingface-pytorch-training docker container out of the box with spectrum for

Axolotl does seem to natively support spectrum.

TBD

The text was updated successfully, but these errors were encountered:

ggbetz · 2024-12-20T15:19:13Z

anakin87 · 2024-12-21T16:25:29Z

This would be great and would prevent users from making mistakes in the manual implementation of this method: for example, the code for integration with other libraries reported in the official repo has some problems. In contrast, the simple implementation in my tutorial and Philipp's code should be correct.

BTW, Spectrum is quite agnostic with respect to training method (SFT, DPO...): the models by VAGO solutions show that it works well for DPO too.

LMK what's the better way to proceed and help with this integration.

qgallouedec added ✨ enhancement New feature or request 🏋 SFT Related to SFT labels Dec 20, 2024

Provide feedback