FID results of GPT-L and GPT-1B on 256*256 images #46

LutingWang · 2024-07-19T07:31:09Z

Hi, thanks for the excellent work. I'm trying to reproduce the results on 256*256 images. The VQGAN model is reproduced successively, achieving $2.10$ rFID. However, the AR part experiences a significant performance gap. More specifically, I use 8 A100-80G GPU to run the following scripts

bash scripts/autoregressive/train_c2i.sh --cloud-save-path xxx --code-path xxx --gpt-model GPT-L --epochs 50
bash scripts/autoregressive/train_c2i.sh --cloud-save-path xxx --code-path xxx --gpt-model GPT-1B --epochs 50

The training results are as follows

Model	Final Loss	FID	Expected FID
GPT-L	7.86	4.62	4.22
GPT-1B	7.33	4.13	3.09

Is the final loss reasonable? Do you have any idea what the reason might be?

Thanks!

PeizeSun · 2024-07-23T06:48:07Z

Hi~
I don’t understand what is reproducing the result on 224x224. The expected FID is in 256x256.

LutingWang · 2024-07-23T12:39:22Z

Hi~ I don’t understand what is reproducing the result on 224x224. The expected FID is in 256x256.

Sorry for the mistake. I was trying to emphasize that the image resolution is not 384x384, but I mistakenly wrote 224.

msed-Ebrahimi · 2024-07-24T03:09:02Z

Hi~ I don’t understand what is reproducing the result on 224x224. The expected FID is in 256x256.

Hi. Thank you for this awesome repo. I have the same issue with the original code that the loss ends around 7.3 after 300 epochs.

LutingWang changed the title ~~FID results of GPT-L and GPT-1B on 224*224 images~~ FID results of GPT-L and GPT-1B on 256*256 images Jul 23, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

FID results of GPT-L and GPT-1B on 256*256 images #46

FID results of GPT-L and GPT-1B on 256*256 images #46

LutingWang commented Jul 19, 2024 •

edited

Loading

PeizeSun commented Jul 23, 2024

LutingWang commented Jul 23, 2024

msed-Ebrahimi commented Jul 24, 2024

FID results of GPT-L and GPT-1B on 256*256 images #46

FID results of GPT-L and GPT-1B on 256*256 images #46

Comments

LutingWang commented Jul 19, 2024 • edited Loading

PeizeSun commented Jul 23, 2024

LutingWang commented Jul 23, 2024

msed-Ebrahimi commented Jul 24, 2024

LutingWang commented Jul 19, 2024 •

edited

Loading