General AlphaZero

Overview

This project implements a game-playing agent based on the AlphaZero algorithm, inspired by the DeepMind paper "Mastering Chess and Shogi by Self-Play with a General Reinforcement Learning Algorithm". The agent is designed to play games with itself and learn through reinforcement learning. The agent generates data based on modified MCTS tree search algorithm and trains itself on the generated data.

Research Paper

Features

Self-play: The agent plays games against itself to generate training data.
Neural Network: Implements a neural network for the game-playing policy and value estimation.
Reinforcement Learning: Utilizes reinforcement learning techniques for training the agent.
Game Environment: Support for multiple game environments (TicTacToe, ConnectFour, etc.).
Data Creation: Create and Save game data for model to be trained on.

Installation

Prerequisites:

Make sure you are running python version of 3.8 - 3.11.

Install Dependencies:

Clone the repository: git clone https://github.com/amanmoon/general_alpha_zero.git
Navigate to the project directory: cd general_alpha_zero
Install dependencies: pip install -r requirements.txt

Usage

Activate Virtual Environment:

MacOS / Linux

Navigate to the project directory: cd general_alpha_zero
Create Virtual Environment: python3.10 -m venv <venv Name>
Activate Virtual Environment:source <venv Name>/bin/activate
Deactivate Virtual Environment: deactivate

Train the Model:

Choose Game you wish to Train Model for and import right Classes inside the Train.py file.
Choose appropriate hyperparameters in args.
To run the Train Script: python3 Train.py

Playing against a Model:

Choose player you wish to play as and modify search parameters in args inside the Play.py file.
Import Correct Model.
To run the Play Script: python3 Play.py

Bet two Models:

Choose Models you wish to bet against each other inside the Arena file.
To run the Arena Script: python3 Arena.py

Contact

For questions or suggestions, feel free to reach out at [email protected].

Name		Name	Last commit message	Last commit date
Latest commit History 59 Commits
Games		Games
__pycache__		__pycache__
.gitignore		.gitignore
Alpha_MCTS.py		Alpha_MCTS.py
Alpha_MCTS_Parallel.py		Alpha_MCTS_Parallel.py
Alpha_Zero.py		Alpha_Zero.py
Alpha_Zero_Parallel.py		Alpha_Zero_Parallel.py
Arena.py		Arena.py
LICENSE		LICENSE
MCTS.py		MCTS.py
Play.py		Play.py
README.md		README.md
Train.py		Train.py
requirements.txt		requirements.txt
save_games.py		save_games.py
train_from_saved_games.py		train_from_saved_games.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

General AlphaZero

Overview

Research Paper

Table of Contents

Features

Installation

Prerequisites:

Install Dependencies:

Usage

Activate Virtual Environment:

MacOS / Linux

Train the Model:

Playing against a Model:

Bet two Models:

Contact

About

Releases

Packages

Languages

License

amanmoon/AlphaZero

Folders and files

Latest commit

History

Repository files navigation

General AlphaZero

Overview

Research Paper

Table of Contents

Features

Installation

Prerequisites:

Install Dependencies:

Usage

Activate Virtual Environment:

MacOS / Linux

Train the Model:

Playing against a Model:

Bet two Models:

Contact

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages