Actor Critic Agents are less sample efficient in general (?) since #290 #354
Labels
bug
Something isn't working
discussion
This issue needs further discussion
enhancement
New feature or request
question
Further information is requested
@mmcenta , it seems that some changes in the model since #290 are making A2C and PPO worse on some benchmarks in particular @YannBerthelot 's probing environment tests.
Let's discuss
The text was updated successfully, but these errors were encountered: