GTO: Document SemVer practices for ML models #231

aguschin · 2022-11-22T05:31:59Z

Semantic versioning is the accepted way to version code. How should artifacts be versioned?
I have been asked this by a Data Scientist some time ago. Given that everyone is free to do whatever he wants, perhaps giving a hint is not bad...?

I formulated a reasonable convention for models, not sure if it could be of any use:

Patch

Model as a black-box is as before, it only outputs different numbers.

Typical scenario: model have been trained with more recent data
Typical scenario 2: changed hyper-parameters

Minor

May want to take advantage of additional outputs or additional functionalities

Typical scenario 1: model now has predict_proba() in addition to predict()
Typical scenario 2: model now outputs a json with an additional field confidence_interval, in addition to predicted_values

Major

Need to re-visit the code that calls the model to serve it (breaking change)

Typical scenario 1: model APIs have changed
Typical scenario 2: model expects different input data format
Typical scenario 3: model relies on different libraries, need to re-build the venv (or even the OS-level libraries)

Originally posted by @francesco086 in #199 (comment)

🧵 See the thread for more opinions on this

The text was updated successfully, but these errors were encountered:

aguschin changed the title ~~Docs: add a page about SemVer for ML~~ Docs: explain SemVer for ML Nov 22, 2022

aguschin added the A: docs Area: user documentation (gatsby-theme-iterative) label Nov 22, 2022

aguschin mentioned this issue Nov 23, 2022

GTO docs #199

Merged

jorgeorpinel added the type: discussion label Nov 25, 2022

jorgeorpinel changed the title ~~Docs: explain SemVer for ML~~ GTO: explain SemVer for ML Nov 25, 2022

omesser added the p2-nice-to-have Less of a priority at the moment. We don't usually deal with this immediately. label Mar 6, 2023

omesser changed the title ~~GTO: explain SemVer for ML~~ GTO: explain SemVer for ML models Mar 6, 2023

omesser changed the title ~~GTO: explain SemVer for ML models~~ GTO: Document SemVer practices for ML models Mar 6, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

GTO: Document SemVer practices for ML models #231

GTO: Document SemVer practices for ML models #231

aguschin commented Nov 22, 2022 •

edited

Loading

GTO: Document SemVer practices for ML models #231

GTO: Document SemVer practices for ML models #231

Comments

aguschin commented Nov 22, 2022 • edited Loading

Patch

Minor

Major

aguschin commented Nov 22, 2022 •

edited

Loading