You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Based on Mistral 7B so every bit as closed as that except for Ultrafeedback DPO data (the "RL" component).
I hesitate to add this because it is so obviously designed as an attention grab: get a high position on a leaderboard, then release and advertise the leaderboard position as a reason for folks to get in business with you. It is not clear Snorkel is being offered as anything like a serious 'open' model for other than commercial downstream uses.
The text was updated successfully, but these errors were encountered:
The customary 'release by blog post' strategy:
https://snorkel.ai/new-benchmark-results-demonstrate-value-of-snorkel-ai-approach-to-llm-alignment/
HF page with model card used mainly for advertising:
https://huggingface.co/snorkelai/Snorkel-Mistral-PairRM-DPO
Based on Mistral 7B so every bit as closed as that except for Ultrafeedback DPO data (the "RL" component).
I hesitate to add this because it is so obviously designed as an attention grab: get a high position on a leaderboard, then release and advertise the leaderboard position as a reason for folks to get in business with you. It is not clear Snorkel is being offered as anything like a serious 'open' model for other than commercial downstream uses.
The text was updated successfully, but these errors were encountered: