- Advantage Actor critic [1]
- Parallel Advantage Actor critic [2]
- Curiosity-driven Exploration by Self-supervised Prediction [3] [5]
- Proximal Policy Optimization Algorithms [4]
- python3.6
- gym
- OpenCV Python
- PyTorch
- tensorboardX
Modify the parameters in config.conf
as you like.
python train.py
python eval.py
[1] Actor-Critic Algorithms
[2] Efficient Parallel Methods for Deep Reinforcement Learning
[3] Curiosity-driven Exploration by Self-supervised Prediction
[4] Proximal Policy Optimization Algorithms
[5] Large-Scale Study of Curiosity-Driven Learning