Performance drop when resumed training with the empirical normalization #17

BIGheadLL · 2023-11-20T03:58:54Z

Hi there,

We noticed a performance drop when we resumed training with OnPolicyRunner which applied empirical normalization in our env.

There is a gap between the black line and blue one.
Additionally, we found the model performance cannot increase without the empirical normalization (The green and orange ones).

Many thanks.

Mayankm96 · 2023-12-11T11:12:35Z

When you resume training, typically the episodes are "terminated" randomly to encourage a diverse set of sample collection. Otherwise PPO can get stuck in a local minima.

https://github.com/leggedrobotics/rsl_rl/blob/master/rsl_rl/runners/on_policy_runner.py#L67

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Performance drop when resumed training with the empirical normalization #17

Performance drop when resumed training with the empirical normalization #17

BIGheadLL commented Nov 20, 2023

Mayankm96 commented Dec 11, 2023

Performance drop when resumed training with the empirical normalization #17

Performance drop when resumed training with the empirical normalization #17

Comments

BIGheadLL commented Nov 20, 2023

Mayankm96 commented Dec 11, 2023