Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Questions about GPU requirements and usage #12

Open
SlenderMongoose opened this issue Jun 26, 2024 · 2 comments
Open

Questions about GPU requirements and usage #12

SlenderMongoose opened this issue Jun 26, 2024 · 2 comments

Comments

@SlenderMongoose
Copy link

Thank you for sharing your code and detailed documentation. I am currently using your code and have encountered a few questions regarding the GPU usage.

  1. What is the approximate GPU memory requirement for fine-tuning?
  2. Is it possible to train the model using a single GPU? If so, what settings should I be aware of?
  3. I am currently using multiple RTX 3090 GPUs but encountering out-of-memory (OOM) errors. Is there any way to resolve this issue? If so, how should I set it up?
  4. Interestingly, I can successfully run test.py using a single RTX 3090 GPU, but it fails with multiple RTX 3090 GPUs. Do you have any suggestions on how to resolve this?

Thank you very much for your time and assistance!

@zhengbw0324
Copy link
Collaborator

zhengbw0324 commented Jun 26, 2024

@SlenderMongoose
Hello, it should be possible to train our model on 3090. You can try the following strategies:

  1. Adjust the batch size.
  2. Try to enable deepspeed offload, you can refer to here.
  3. Try to use LoRA for fine-tuning.

However, we do not recommend that you use 3090 for experiments, as it will result in unacceptable time consumption.

@zhengbw0324
Copy link
Collaborator

As for the failure of running the test with multiple GPUs, I think it may be because the communication between multiple GPUs takes up extra memory. You can reduce the batch size or num_beams appropriately.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants