Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

RM数据构造 #55

Open
tcxia opened this issue Mar 26, 2024 · 1 comment
Open

RM数据构造 #55

tcxia opened this issue Mar 26, 2024 · 1 comment

Comments

@tcxia
Copy link

tcxia commented Mar 26, 2024

您好,想问下,论文中说选择10个不同的RM模型对同一个数据打分,这10个RM模型的选择标准是什么?

@refrain-wbh
Copy link
Contributor

十个模型仅仅只有随机种子不同,利用随机性获得一个平均和稳定的reward model打分。

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants