Meta-Gradient RL A2C #207
Unanswered
RobvanGastel
asked this question in
Q&A
Replies: 0 comments
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
Hi!
I have been working with your package to test your meta-gradient RL example. As I have it implemented now, the algorithm converges on the cartpole environment, however, the meta-parameter gamma only trends downwards. Do I use torchopt incorrectly in the code sample below?
Any help would be much appreciated! Thank you!
Beta Was this translation helpful? Give feedback.
All reactions