Based on PARL, the MADDPG algorithm of deep reinforcement learning has been reproduced.
Paper: MADDPG in Multi-Agent Actor-Critic for Mixed Cooperative-Competitive Environments
A simple multi-agent particle world based on gym. Please see here to install and know more about the environment.
Mean episode reward in training process (totally 25000 episodes).
simple |
simple_adversary |
simple_push |
simple_crypto |
simple_speaker_listener |
simple_spread |
simple_tag |
simple_world_comm |
- python3.7+
- paddlepaddle>=2.0.0
- parl>=2.1.1
- PettingZoo==1.17.0
- gym==0.23.1
# To train an agent for simple_speaker_listener scenario
python train.py
# To train for other scenario, model is automatically saved every 1000 episodes
python train.py --env [ENV_NAME]
# To show animation effects after training
python train.py --env [ENV_NAME] --show --restore
# To train and evaluate scenarios with continuous action spaces
python train.py --env [ENV_NAME] --continuous_actions
python train.py --env [ENV_NAME] --continuous_actions --show --restore