Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[Question] Reward net transfer #838

Open
risufaj opened this issue Feb 2, 2024 · 1 comment
Open

[Question] Reward net transfer #838

risufaj opened this issue Feb 2, 2024 · 1 comment

Comments

@risufaj
Copy link

risufaj commented Feb 2, 2024

Hello,

I want to run IRL on a task with some expert demonstrations. The demonstrations are a bit old, and since then, the action space action has increased. For instance, in the first version of the task there were only 5 actions, whereas in the new version there are 3 new actions that can be taken.
Is it possible to train a reward net using the existing expert demonstrations (e.g. using AIRL) and then used the trained reward net to train a new policy considering the added actions? If so, I'm not entirely sure how it would look like when creating a RewardNet class.

I would appreciate some help.

Thanks in advance.

@rizqisubeno
Copy link

I think you can use again reward net again to train a new policy with added actions as long as you use a state-only parameter on the reward net

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants