PyTorch-Counterfactual-Multi-Agent-Policy-Gradients-COMA

This is PyTorch implementation of paper

Foerster, Jakob N., et al. "Counterfactual multi-agent policy gradients." Thirty-Second AAAI Conference on Artificial Intelligence. 2018.

in the multiagent environment "findgoals" https://github.com/Bigpig4396/Multi-Agent-Reinforcement-Learning-Environment The discription of environment is in 'FindGoals.pdf'

You have to install opencv-python and pytorch to run the code. run 'COMA2.py' you will get the training curve like

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
COMA2.py		COMA2.py
FindGoals.pdf		FindGoals.pdf
README.md		README.md
curve.png		curve.png
env_FindGoals.py		env_FindGoals.py

Provide feedback