Skip to content

DPO #1008

Answered by filippo82
fakerybakery asked this question in Q&A
DPO #1008
Dec 27, 2023 · 3 comments · 1 reply
Discussion options

You must be logged in to vote

Yes, it is possible but you currently need to checkout the rl-trainer branch. You can find an example configuration here.

Replies: 3 comments 1 reply

Comment options

You must be logged in to vote
1 reply
@fakerybakery
Comment options

Comment options

You must be logged in to vote
0 replies
Answer selected by fakerybakery
Comment options

You must be logged in to vote
0 replies
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Category
Q&A
Labels
None yet
4 participants