Replies: 1 comment
-
I just took a look at the source code. If I understand it correctly, you would need to use the |
Beta Was this translation helpful? Give feedback.
0 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
I do DPO but Axolotl never evaluates the model.
With SFT it works. DPO -> no evaluation.
Most important parts of my config are below.
Can someone please help me?
Or is evaluation simply not available for DPO?
Many thanks
Philip
Beta Was this translation helpful? Give feedback.
All reactions