-
Notifications
You must be signed in to change notification settings - Fork 98
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Why is lora training so slow? #90
Comments
I had trained lora of 768x768 model with 144 frames on A100, using bf16. |
thanks for the info! ya. that's something I expect to see. but sadly. anyway I will check and update here on anything I will get. |
I met the same problem too. I fine-tune the model with lora on V100 machines. Its speed is about 40s/it. When I don't use the lora, the speed is 26s/it. |
do you find any solution to speed up the lora-training? |
@gulucaptain Not yet. Did you use |
My situation: 8 A100 80G, batchsize 1, 19.35s/it, I feel it very slow. Is that normal? |
help me and help each other, please |
Dear author @yunkchen
Thanks for your awesome work!
I tried to run the lora training using my data, but the speed is very slow --- ~40s/it.
Training details:
512x512 model
2 GPUs - batch size: 1 for each
Is there anything I missed? Please give some hints on this. Thanks!
The text was updated successfully, but these errors were encountered: