-
Notifications
You must be signed in to change notification settings - Fork 56
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Training Results #39
Comments
Hi~ Can you first try to check whether the image code is correct? You can use this script here |
Did you change the batch size, learning rate or other training parameters ? |
Dear Peize, I didn't change anything in .py files. I assumed that the command for 256 resolution is the same as that for 384 and used the script above. |
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Dear authors,
Thanks for your excellent work in autoregressive image generation!
I try to reproduce the training of GPT-B-256 following the instruction provided here. The specific command I used is:
However, after training ~150 epoch on ImageNet1k, it seems that the generated results are still meaningless:
My environment is 8xA5000 GPUs, which is different from yours (8xA100). I wonder whether the results are sensitive to such a difference, and whether the problem would be alleviated after full training (300 epochs).
Thanks for your help in advance :)
The text was updated successfully, but these errors were encountered: