Gated Multi-Attention Representation in Reinforcement learning

This implementation contains:

Deep Q-network(DQN)
- vanilla DQN model
RS-DQN
- DQN model with region-sensitive(RS) module
Local-DQN
- DQN model with local attention(a glimpse network) module
ALSTM
- Attention combined with LSTM based on DQN
GMAQN
- Our work

Dependencies

run the command pip3 install -r requirements.txt and install all the required packages.

Training

To train on a local machine or in a local container, run the following command: To train GAMQN model for Seaquest:

$ python train.py --env Seaquest-v4 --model GMAQN

To train ALSTM model for Seaquest:

$ python train.py --env Seaquest-v4 --model ALSTM

Grad-CAM visualization videos

Take the Seaquest environment in Atari 2600 games as an example.Our agent receives visual input as a stream of 210x160px RGB images (top).Grad-CAM can mark the regions of evidence for the current action in each frame via heat. The heat maps can clearly show he current ehavior and ffensive policy of the agent.

In the heat maps, we also show how GMAQN can be trained to supplement oxygen after the agent is aware that oxygen is insufficient. In more detail, in the first picture, the submarine is destroying the enemy, while in the second, third, and fourth pictures, the agent observed oxygen is depleting. The fifth and sixth pictures show that the submarine floats to the surface to supplement oxygen. In the seventh picture, the submarine starts to destroy the enemy after replenishing oxygen.

Name		Name	Last commit message	Last commit date
Latest commit History 15 Commits
Visualize		Visualize
__pycache__		__pycache__
deeprl_prj		deeprl_prj
model		model
README.md		README.md
helper.py		helper.py
requirements.txt		requirements.txt
train.py		train.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Gated Multi-Attention Representation in Reinforcement learning

Dependencies

Training

Grad-CAM visualization videos

About

Releases

Packages

Languages

Felixvillas/Gated-Multi-Attention-in-RL

Folders and files

Latest commit

History

Repository files navigation

Gated Multi-Attention Representation in Reinforcement learning

Dependencies

Training

Grad-CAM visualization videos

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages