Skip to content

Combining CLI and COLAB for using pre-trained models #794

Answered by talmo
auesro asked this question in Help!
Discussion options

You must be logged in to vote

Hi @auesro and @isabelperezf,

You are correct that training does not pick up from the pre-trained model. This is intentional as we find that randomly initialized models trained from scratch every time are more stable and can train just as well. This is partially because we avoid "reinforcing" errors that would be learned early on when there is little labeled data available and the model is heavily overfitting to it.

That said, if you want to try it yourself, we have a notebook on resumable training you're welcome to run through. It's a bit more advanced than the basic training notebook, but if you follow it step-by-step it'll do what you're looking for.

Let us know if you this works for y…

Replies: 2 comments 1 reply

Comment options

You must be logged in to vote
0 replies
Answer selected by auesro
Comment options

You must be logged in to vote
1 reply
@talmo
Comment options

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Category
Help!
Labels
None yet
2 participants