confusion on epochs #44

Siirui · 2023-08-01T03:31:06Z

Hi!
I am wondering the number of epochs in experiment. The epoch is set to 200 for EC stated in the paper, but in the config the epoch is set to 50. Whether should I modify the epochs to 200 for reproducing the experiment?

Thanks for your help!

Oxer11 · 2023-08-01T13:26:05Z

Hi, thanks for the question. In the original experiment, we train all models for 200 epochs. When I organize the code, I find training for 50 epochs can yield similar performance as training for 200 epochs. So it should be fine to train for just 50 epochs.

Siirui · 2023-08-02T08:26:28Z

Thank you for your reply! I have another question, when I try to reproduce the downstream tasks like EC, you mentioned in other issues that the batch size should be set to 8 using single GPU. But I found the finetuning process with low GPU-Util. If there is enough memory in GPU, can I enlarge the batch size without influencing the performance of Gearnet?
Thanks for your help!

Oxer11 · 2023-08-02T20:32:55Z

Actually, I think the downstream performance is very sensitive to batch size. For the current config, I find batch_size=8 is the best setting. Typically, if you want to change the batch_size, you also need to change the learning rate and number of epochs to get similar performance. You can try other batch_size to see whether similar performance can be achieved.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

confusion on epochs #44

confusion on epochs #44

Siirui commented Aug 1, 2023 •

edited

Loading

Oxer11 commented Aug 1, 2023

Siirui commented Aug 2, 2023

Oxer11 commented Aug 2, 2023

confusion on epochs #44

confusion on epochs #44

Comments

Siirui commented Aug 1, 2023 • edited Loading

Oxer11 commented Aug 1, 2023

Siirui commented Aug 2, 2023

Oxer11 commented Aug 2, 2023

Siirui commented Aug 1, 2023 •

edited

Loading