A2C-ConvLSTM-Starcraft2

This is an old repository and code is not working properly. Sequences of (s,a,r) should be batched properly with timestep dimension.

A2C with ConvLSTM agent playing Starcraft 2 (DeepMind's FullyConv LSTM)

Synchronous Advantage Actor Critic (synchronous variation of the A3C) with Convolutional LSTM playing Starcraft 2 using DeepMind's API pysc2.

The code is based on pekaalto's FullyConv Net, although there are some modifications of the original version, and there is the ConvLSTM added after the state concatenation. Please note that there is no PPO active here and the code is for experimentation purposes.

Dependencies

Python 3
pysc2
Tensorflow

Name		Name	Last commit message	Last commit date
Latest commit History 7 Commits
actorcritic		actorcritic
common		common
LICENSE		LICENSE
README.md		README.md
network_all.py		network_all.py
run_agent.py		run_agent.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

A2C-ConvLSTM-Starcraft2

This is an old repository and code is not working properly. Sequences of (s,a,r) should be batched properly with timestep dimension.

Dependencies

About

Releases

Packages

Languages

License

ConstantinosM/A2C-ConvLSTM-Starcraft2-with-pysc2

Folders and files

Latest commit

History

Repository files navigation

A2C-ConvLSTM-Starcraft2

This is an old repository and code is not working properly. Sequences of (s,a,r) should be batched properly with timestep dimension.

Dependencies

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages