pretrained weights are probably incorrect #7

BestSonny · 2022-05-22T08:18:21Z

The pretrained weights seem to be wrong.

For example, the vit_base has a dimension of 1024.

Could you upload the correct version? Thanks

brewormly · 2022-05-23T16:06:59Z

I have also some issues regarding the pre-trained checkpoints. The checkpoints only include the keys "target_encoder" and "prototypes". If I want to load the checkpoint via the training script, I get errors because the keys "epoch" and "encoder" are missing.

MidoAssran · 2022-06-07T18:55:32Z

Hi @BestSonny, There are 1024 prototypes used in the loss, but I just checked the ViT-B/16 and ViT-B/4 pre-trained weights, and they both have the correct output dimension of 768. Please let me know if you would like some more clarification or help loading the models!

MidoAssran · 2022-06-07T18:58:09Z

Hi @brewormly, yes the current checkpoints only include the "target_encoder" since those are the network used at the end of pre-training to obtain the results in the paper, but I would be happy to release the full checkpoints as well in case you find these useful! Will ping you once these are online!

sayakpaul · 2022-08-25T05:30:26Z

@MidoAssran possible to release the ImageNet-1k specific checkpoints (fine-tuned and / or linear-eval'd)?

By "linear-eval'd" I mean keeping the target encoder frozen and just training a linear layer on top of it. So, essentially, the target encoder params (which are already released) and the linear layer params.

sayakpaul · 2022-08-25T05:51:45Z

Also, the target_encoder key in the released weights -- seems like it contains two things - the actual encoder plus the projection head (module.fc* params). Is the projection head needed for downstream tasks?

@MidoAssran

CemKaratastan · 2023-02-01T00:33:12Z

I have also some issues regarding the pre-trained checkpoints. The checkpoints only include the keys "target_encoder" and "prototypes". If I want to load the checkpoint via the training script, I get errors because the keys "epoch" and "encoder" are missing.

I have the same issue !

ludles · 2023-07-17T14:58:11Z

Hi @brewormly, yes the current checkpoints only include the "target_encoder" since those are the network used at the end of pre-training to obtain the results in the paper, but I would be happy to release the full checkpoints as well in case you find these useful! Will ping you once these are online!

Sorry for the late reply after one year. I wonder if there is still a plan to release the full checkpoints? I think they will be very helpful in continuing the training for other tasks.

sayakpaul mentioned this issue Aug 31, 2022

MSN (Masked Siamese Networks) for ViT huggingface/transformers#18815

Merged

4 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

pretrained weights are probably incorrect #7

pretrained weights are probably incorrect #7

BestSonny commented May 22, 2022

brewormly commented May 23, 2022

MidoAssran commented Jun 7, 2022

MidoAssran commented Jun 7, 2022

sayakpaul commented Aug 25, 2022

sayakpaul commented Aug 25, 2022 •

edited

Loading

CemKaratastan commented Feb 1, 2023

ludles commented Jul 17, 2023

pretrained weights are probably incorrect #7

pretrained weights are probably incorrect #7

Comments

BestSonny commented May 22, 2022

brewormly commented May 23, 2022

MidoAssran commented Jun 7, 2022

MidoAssran commented Jun 7, 2022

sayakpaul commented Aug 25, 2022

sayakpaul commented Aug 25, 2022 • edited Loading

CemKaratastan commented Feb 1, 2023

ludles commented Jul 17, 2023

sayakpaul commented Aug 25, 2022 •

edited

Loading