You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Would you open source the training code and algorithm process for ORM? I saw the training parameters for ORM in the paper. The algorithm training process mentioned in the paper seems to only include the training parameters and process for the actor-critic, but there seems to be no algorithm process for ORM. Although the ORM model has been open-sourced, it seems that different web data might require different ORM?
The text was updated successfully, but these errors were encountered:
Sorry, we are only planning to open source the trained ORM, the training data is not going to be open source. The training code is just using the SFT training script in llama-factory. All websits use the same ORM.
Would you open source the training code and algorithm process for ORM? I saw the training parameters for ORM in the paper. The algorithm training process mentioned in the paper seems to only include the training parameters and process for the actor-critic, but there seems to be no algorithm process for ORM. Although the ORM model has been open-sourced, it seems that different web data might require different ORM?
The text was updated successfully, but these errors were encountered: