Skip to content

Latest commit

 

History

History
 
 

DeepQNetwork

breakout

video demo

Reproduce (performance of) the following reinforcement learning methods:

Performance & Speed

Claimed performance in the paper can be reproduced, on several games I've tested with.

DQN

On one (Maxwell) TitanX, Double-DQN took ~18 hours of training to reach a score of 400 on breakout.

Double-DQN with nature paper setting runs at 60 batches (3840 trained frames, 240 seen frames, 960 game frames) per second on TitanX.

How to use

Install ALE and gym.

Download an atari rom, e.g.:

wget https://github.com/openai/atari-py/raw/master/atari_py/atari_roms/breakout.bin

Start Training:

./DQN.py --rom breakout.bin
# use `--algo` to select other DQN algorithms. See `-h` for more options.

Watch the agent play:

# Download pretrained models or use one you trained:
wget http://models.tensorpack.com/DeepQNetwork/DoubleDQN-Breakout.npz
./DQN.py --rom breakout.bin --task play --load DoubleDQN-Breakout.npz

A3C code and models for Atari games in OpenAI Gym are released in examples/A3C-Gym