You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
I've read the guides/developers and reward/instructor , I have some questions regarding the training process of the reward model, and I hope you can help me with it.
reward/instructor#dataset mentioned that Once open-asisstant dataset are available it will be added here.. Has the task of supporting OPEN-ASSISTANT's own dataset been scheduled? How long will it take to complete it?
Is the process of training the reward model currently done manually? Will it be integrated into the data loop for automatic execution in the future?( The idea here is that the reward model will automatically perform an initial ranking of the data, and then user feedback will be used as further training data for it.)
This discussion was converted from issue #2017 on June 09, 2023 11:37.
Heading
Bold
Italic
Quote
Code
Link
Numbered list
Unordered list
Task list
Attach files
Mention
Reference
Menu
reacted with thumbs up emoji reacted with thumbs down emoji reacted with laugh emoji reacted with hooray emoji reacted with confused emoji reacted with heart emoji reacted with rocket emoji reacted with eyes emoji
-
Dear team,
I've read the guides/developers and reward/instructor , I have some questions regarding the training process of the reward model, and I hope you can help me with it.
Once open-asisstant dataset are available it will be added here.
. Has the task of supporting OPEN-ASSISTANT's own dataset been scheduled? How long will it take to complete it?Thank you for your time and assistance.
Beta Was this translation helpful? Give feedback.
All reactions