Risk-aware RL using rainbow IQN

Project Overview : Generate high-frequency crypto trading program via Reinforcenment Learning

Problem Statement: Gain high earnings from leverage trading.

Using Leverage incurs higher lisk than spot trading. As a result, risk-aware algorithm is needed. I will use IQN algorithm to better cope with those risks.

Metrics: Return rate

Simulate trading on test data out of the training data time and measure the return rate.

Usage

Train

python train_leverage.py --data_path=binance_futures_1d_train.db

Test

python test.py --data_path=binance_futures_1d_test.db --load_file=saves/2021-12-31-01:14:16/IQN_leverage_1200.pth

To change the risk-preference, change the RISK_AVERSE in default_hyperparameters.py

Requirements

you can check libraries we used at requirments

Data Exploration

We use daily bitcoin OHLCV (open-high-low-close-volume) data starting ranging 2017-01-01~2021-12-30

For more information, please refer to data exploration notebook.

We learned that volume data and price data have different scales. However, price columns (open-high-low-close) have similar scales.

So, we normlize data in two ways. Normalize volume data by volume data and Normalize price data by price data. To be specific,

lines from model.py :L139~L140

state[:,:,:volume_axis] /= state[:,0,open_axis].reshape(bs,1,1)
            state[:,:,volume_axis] /= state[:,0,volume_axis].reshape(bs,1,1)

Methodology

Implementation

Model
: Attention Model + RL (IQN)

Input : N time-series BTC ohlcv (N * 5)
Output : Decision (-100% short to 100% long)


Environment
: 3X Leverage Env


Hyperparameters
: learning_rate
: tau (how sensitive model cope with the risk)

Attention, which is now so famous with Transformer model is known to show good performance on time-series data.

IQN is an distributional Reinforcement Algorithm which is basically value-based model. However, IQN is different from classical DQN in that it predicts the value of the actions in distributional manner rather than point estimation manner. Also, IQN learns the distribution by qunatile manner which allows it to act differently to the risk when we control the hyperparameter $\tau$.

Results

Yield on test data

Base line strategy : Buy & hold

Test Yield : 26.6% yield on average [0.19, 0.26, -1.09, 1.38, 0.80, 0.23, 0.15, 0.54, 1.23, -1.05]

python test_baseline.py --data_path=binance_futures_1d_test.db

Experimental Strategy : IQN

Test Yield : 205.3% yield on average [-0.68, 0.19, 3.74, 6.83, 1.90, 2.31, 3.26, 1.82, 0.68, 0.45]

python test.py --data_path=binance_futures_1d_test.db --load_file=saves/IQN_1d/IQN_leverage_1200.pth

Conclusion

Reflection

We can see that our strategy outperformed baseline strategy. However, it still suffered from liquidation which is a deadly harmful risk from using leverage. It needs more tunings if it were to put in real usage.

Further work

[ ] Use multiple assets at the same time rather than trade bitcoin only

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
DQNTradingAgent		DQNTradingAgent
saves/IQN_1d		saves/IQN_1d
scripts		scripts
.gitignore		.gitignore
README.md		README.md
UDACITY_project_report.pdf		UDACITY_project_report.pdf
arguments.py		arguments.py
binance_futures_1d_test.db		binance_futures_1d_test.db
binance_futures_1d_train.db		binance_futures_1d_train.db
custom_hyperparameters.py		custom_hyperparameters.py
data_downloader.py		data_downloader.py
experiment.ipynb		experiment.ipynb
exploration.ipynb		exploration.ipynb
leverage_trading_env.py		leverage_trading_env.py
requirements.txt		requirements.txt
test.py		test.py
test_baseline.py		test_baseline.py
trade_visualization.png		trade_visualization.png
train_leverage.py		train_leverage.py
visualization.ipynb		visualization.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Risk-aware RL using rainbow IQN

Table of contents

Blog post is available at here

Visualization of Trading result on 3X leverage Trading Env.

Project Overview : Generate high-frequency crypto trading program via Reinforcenment Learning

Usage

Requirements

Data Exploration

Methodology

Results

Yield on test data

Conclusion

Reflection

Further work

About

Releases

Packages

Languages

jsrimr/udacity-final-project

Folders and files

Latest commit

History

Repository files navigation

Risk-aware RL using rainbow IQN

Table of contents

Blog post is available at here

Visualization of Trading result on 3X leverage Trading Env.

Project Overview : Generate high-frequency crypto trading program via Reinforcenment Learning

Usage

Requirements

Data Exploration

Methodology

Results

Yield on test data

Conclusion

Reflection

Further work

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages