This repository contains the implementation and notes of different Reinforcement Learning Algorithms and techniques I have learned throughout the Udacity Deep Reinforcement Learning Nanodegree.
Each algorithm is used to solve different OpenAI gym environements.
- Monte Carlo
- Temporal Difference Algorithms
- Sarsa(0)
- Q-Learning / Sarsamax
- Expected Sarsa
- RL in Continuous Space
- Deep Q-Network
- Double Deep-Q-Network
- Dueling Deep-Q-Network
- Navigation: Train an agent to collect yellow bananas while avoiding blue bananas using
Deep Q-learning Algorithm
. - Continuous Control: Train an robotic arm to reach target locations using
DDPG Algorithm
. - Collaboration and Competition: Train a pair of agents to play tennis using
MADDPG Algorithm
.
Follow the instructions given in the Installation_Guide.md to install the dependencies and run the code present in this repository locally.
I have also provided the notes I created while learning the above mentioned algorithms and techniques. You can find these notes in the RL-Notes folder.
- The RL Framework
- Monte Carlo Methods
- Temporal Difference Methods
- RL in Continuous Space
- Deep Q Networks
- Deep Q-Network (DQN)
- Double DQN
- Prioritized Experience Replay
- Dueling DQN
- Rainbow DQN