Guidance on Training from Scratch or Fine-Tuning #164

alfausa1 · 2025-01-15T08:05:53Z

Hi,
I would like to ask how many images you would recommend for training a model from scratch, and what weights you would suggest starting with.

My use case is object segmentation on plain backgrounds. The general model currently works quite well for most cases, but there are a few specific scenarios that could be improved. This is why I’m considering training or fine-tuning.

I have a dataset of around 7,000 images at 2K resolution. What would you recommend in this case?

Thank you in advance for your help!

ZhengPeng7 · 2025-01-15T11:13:02Z

For common cases with no extremely complicated shapes, 500-1,000 images should be enough for training from scratch.
If your cases are very different from the training sets I used to train the general version weights, I suggest training from scratch when you have enough images. Otherwise, fine-tuning could be a better way.

In your case, I recommend training from scratch. BTW, you can check the model efficiency part in README; use FP16 + compile==True + PyTorch==2.5.1 to try to save GPU memory to do less downscaling on your 2K data.

Roshan-digi5 · 2025-01-16T05:58:15Z

Hello,

First of all, thank you for your incredible work and contributions!

I want to train a model specifically for removing backgrounds from car images. I have a dataset of approximately 80,000 images. Could you guide me on the best practices to follow, which model and settings would be most suitable, and whether there are any tutorials available for training or fine-tuning a model?

ZhengPeng7 · 2025-01-16T14:39:05Z

I've made a guideline of fine-tuning in my README. For settings of fine-tuning, you can use the default settings except for the epochs. If you still have a problem after following it, plz tell me.

Roshan-digi5 · 2025-01-17T04:34:33Z

Thank you will let you know in case of any issue.

alfausa1 · 2025-01-17T09:01:24Z

Hi,

Thank you so much for taking the time to reply!

I wanted to ask specifically about the configurations, losses and backbone you would recommend for my use case. Are there any particular hyperparameters or architectures you find especially suitable for this type of task? Any additional guidance would be greatly appreciated.

Thanks again for your support!

ZhengPeng7 · 2025-01-17T09:14:27Z

In my mind, car segmentation should have fewer contour details or the need for transparency. If so, you can train the model with fewer epochs and higher weights of IoU loss to accelerate the convergence.
I may come up with more points in the future, but currently that's all.

alfausa1 · 2025-01-17T09:19:11Z

Sorry for not specifying earlier, my use case is object segmentation on a plain background (not cars). Many objects do have transparencies and some small details like tiny holes.

ZhengPeng7 · 2025-01-17T09:23:07Z

That would be a general case. I'm not sure about it (otherwise, I would have added the updates to the default settings).

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Guidance on Training from Scratch or Fine-Tuning #164

Guidance on Training from Scratch or Fine-Tuning #164

alfausa1 commented Jan 15, 2025

ZhengPeng7 commented Jan 15, 2025

Roshan-digi5 commented Jan 16, 2025

ZhengPeng7 commented Jan 16, 2025

Roshan-digi5 commented Jan 17, 2025

alfausa1 commented Jan 17, 2025

ZhengPeng7 commented Jan 17, 2025

alfausa1 commented Jan 17, 2025

ZhengPeng7 commented Jan 17, 2025

Guidance on Training from Scratch or Fine-Tuning #164

Guidance on Training from Scratch or Fine-Tuning #164

Comments

alfausa1 commented Jan 15, 2025

ZhengPeng7 commented Jan 15, 2025

Roshan-digi5 commented Jan 16, 2025

ZhengPeng7 commented Jan 16, 2025

Roshan-digi5 commented Jan 17, 2025

alfausa1 commented Jan 17, 2025

ZhengPeng7 commented Jan 17, 2025

alfausa1 commented Jan 17, 2025

ZhengPeng7 commented Jan 17, 2025