Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

About the tranning process for ORM #30

Closed
K-THU opened this issue Dec 25, 2024 · 1 comment
Closed

About the tranning process for ORM #30

K-THU opened this issue Dec 25, 2024 · 1 comment

Comments

@K-THU
Copy link

K-THU commented Dec 25, 2024

Would you open source the training code and algorithm process for ORM? I saw the training parameters for ORM in the paper. The algorithm training process mentioned in the paper seems to only include the training parameters and process for the actor-critic, but there seems to be no algorithm process for ORM. Although the ORM model has been open-sourced, it seems that different web data might require different ORM?

@QZH-777
Copy link
Collaborator

QZH-777 commented Dec 26, 2024

Sorry, we are only planning to open source the trained ORM, the training data is not going to be open source. The training code is just using the SFT training script in llama-factory. All websits use the same ORM.

@QZH-777 QZH-777 closed this as completed Dec 26, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants