Project 1: Navigation

This is one of my Udacity Deep Reinforcement Learning Nanodegree projects. Most codes are referred to what I've learned from the course.

Introduction

For this project, we will train an agent to navigate and collect bananas!

A reward of +1 is given if a yellow banana is collected, and a reward of -1 if a blue banana is. Our goal is to collect as many yellow bananas as possible and to avoid blue bananas.

The state space is consisted of 37 dimensions, agent's velocity, , directions, and so on. The agent has four discrete actions, 0-move forward, 1-move backward, 2-turn left, and 3-turn right.

The task is episodic. The goal is to train the agent to get an average score of +13 over 100 consecutive episodes.

Getting Started

Create (and activate) a new environment with Python 3.6.

Linux or Mac:

conda create --name env python=3.6
source activate drlnd

Windows:

conda create --name env python=3.6 
activate drlnd

Clone this repository and navigate to the python/ folder. Then, install several dependencies.

git clone https://github.com/nithiroj/DQN-Navigation.git
cd DQN-Navigation/python
pip install .

Download the environment that matches your operating system, place it in DQN-Navigation/ folder, and unzip it.
- Linux: click here
- Mac OSX: click here (Banana.app already in this repository)
- Windows (32-bit): click here
- Windows (64-bit): click here

Training Instructions

In this repository, there are four trained algorithms-basic or vanila DQN (basic.pth), double DQN(double.pth), dueling DQN(dueling.pth), or both double and dueling DQN(double_dueling.pth). The trained models and results (graph and scores) have been provided in model/ and report/ respectively.

You can retrain or train these models with your own hyperparameters. Please be notices that our provided models and results will be replaced accordingly. Just make the copies if you want to keep ours.

Option 1

Follow the instructions in Navigation.ipynb to get started with training your own agent!

Option 2

Run command line on your terminal. Define --env_file if it's not Banana.app.

python navigation.py                     # to train with basic DQN model
python navigation.py --double            # to train with double DQN model
python navigation.py --dueling           # to train with dueling DQN model
python navigation.py --double --dueling  # both double and dueling DQN model

optional arguments:
  -h, --help           show this help message and exit
  --play_eps PLAY_EPS  Train if 0 else play episodes, default 0
  --env_file ENV_FILE  Unity environment binary file, default Banana.app
  --dueling            Enable dueling DQN
  --double             Enable double DQN

Watch our smart agents

To watch how the agent performs, define the number of playing episodes --play_eps and the agent you prefer. For eaxmple, to watch a double-DQN agent play for two episodes:

python navigation.py --double --play_eps 2

More informations

You can find more detais in implementation-algorithms (DQN and extensions), model architectures,and choosen hyperparameters-and the achieved rewards in Report.ipynb.

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
Banana.app/Contents		Banana.app/Contents
images		images
model		model
python		python
report		report
.gitignore		.gitignore
Navigation.ipynb		Navigation.ipynb
README.md		README.md
Report.ipynb		Report.ipynb
dqn_agent.py		dqn_agent.py
dqn_model.py		dqn_model.py
navigation.py		navigation.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Project 1: Navigation

Introduction

Getting Started

Training Instructions

Option 1

Option 2

Watch our smart agents

More informations

About

Releases

Packages

Languages

nithiroj/DQN-Navigation

Folders and files

Latest commit

History

Repository files navigation

Project 1: Navigation

Introduction

Getting Started

Training Instructions

Option 1

Option 2

Watch our smart agents

More informations

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages