a small model such as Mobilenet v2 for pre-training #25

mmmz28 · 2023-04-13T02:24:02Z

Thank you for your excellent work. Replacing the transformer with CNN does make deployment more friendly. Furthermore, I'm wondering if using a smaller model such as Mobilenet v2 for pre-training and then fine-tuning downstream would be effective?

keyu-tian · 2023-04-19T12:20:30Z

Thank you and we agree that this could be of general interest and value. We will consider running SparK on mobilenet recently (perhaps v2 and v3), or you can try it out too. (see tutorial at https://github.com/keyu-tian/SparK/tree/main/pretrain#tutorial-for-pretraining-your-own-cnn-model).

xylcbd · 2023-05-09T07:46:16Z

@keyu-tian Can I use swinv2-base as the backbone for pre-training?

keyu-tian · 2023-05-12T10:05:05Z

@xylcbd sorry but SparK is not suitable for this. Our SparK can pretrain any CNN model but swinv2 is a transformer. Maybe you can use MAE or SimMIM to pretrain swin transformer.

keyu-tian added todo/gpu-needed and removed todo/gpu-needed labels May 24, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

a small model such as Mobilenet v2 for pre-training #25

a small model such as Mobilenet v2 for pre-training #25

mmmz28 commented Apr 13, 2023

keyu-tian commented Apr 19, 2023 •

edited

Loading

xylcbd commented May 9, 2023

keyu-tian commented May 12, 2023 •

edited

Loading

a small model such as Mobilenet v2 for pre-training #25

a small model such as Mobilenet v2 for pre-training #25

Comments

mmmz28 commented Apr 13, 2023

keyu-tian commented Apr 19, 2023 • edited Loading

xylcbd commented May 9, 2023

keyu-tian commented May 12, 2023 • edited Loading

keyu-tian commented Apr 19, 2023 •

edited

Loading

keyu-tian commented May 12, 2023 •

edited

Loading