We read every piece of feedback, and take your input very seriously.
To see all available qualifiers, see our documentation.
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Support spectrum training in sft script / cli, e.g. by passing unfrozen_parameters via yaml obtained with spectrum snr analysis:
trl sft \ --model_name_or_path meta-llama/Llama-3.1-8B-Instruct \ --unfrozen_parameters snr_results_Llama-3.1-8B_50percent.yaml \ ...
I'd love to use the huggingface-pytorch-training docker container out of the box with spectrum for
Axolotl does seem to natively support spectrum.
TBD
The text was updated successfully, but these errors were encountered:
It seems @philschmid has in implementation here: https://github.com/philschmid/deep-learning-pytorch-huggingface/blob/391f19ba06c128a2a290b3bdcb717ad6ff794fd7/training/scripts/run_sft.py#L54-L77 and the question is maybe just what's the best cleanest way to integrate this natively in trl?
Sorry, something went wrong.
This would be great and would prevent users from making mistakes in the manual implementation of this method: for example, the code for integration with other libraries reported in the official repo has some problems. In contrast, the simple implementation in my tutorial and Philipp's code should be correct.
BTW, Spectrum is quite agnostic with respect to training method (SFT, DPO...): the models by VAGO solutions show that it works well for DPO too.
LMK what's the better way to proceed and help with this integration.
No branches or pull requests
Feature request
Support spectrum training in sft script / cli, e.g. by passing unfrozen_parameters via yaml obtained with spectrum snr analysis:
Motivation
I'd love to use the huggingface-pytorch-training docker container out of the box with spectrum for
Axolotl does seem to natively support spectrum.
Your contribution
TBD
The text was updated successfully, but these errors were encountered: