You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Hi, thanks for your great job. I am trying to play around with your code but met some issues.
If I understand correctly, I have to configure a file like test_webarena_lite.raw.json for the corresponding instruction from webrl-sft-data. However, I have no idea about the supposed configuration file for each task instruction from webrl-sft-data, for example the site and start-url. Could u pls give some help here? Appreciate!
The text was updated successfully, but these errors were encountered:
Besides, I guess we need the reward function from VAB-lite given the line 3 of algorithm 1 in the paper. Could u help with this as well? Thanks! @QZH-777@Xiao9905
Hi, thanks for your great job. I am trying to play around with your code but met some issues.
If I understand correctly, I have to configure a file like
test_webarena_lite.raw.json
for the corresponding instruction fromwebrl-sft-data
. However, I have no idea about the supposed configuration file for each task instruction fromwebrl-sft-data
, for example the site and start-url. Could u pls give some help here? Appreciate!The text was updated successfully, but these errors were encountered: