multiagent-comm

This is the repository of Mihir Patel, Nikhil Sardana, and Vinjai Vale for our CS 234 (Reinforcement Learning) final project, "Multi-Agent Cooperation Against Adversarial Agents Through Communication." The final report is available here.

This project is an exploration of communication in multi-agent reinforcement learning. We mostly considered the performance and behavior of the MADDPG algorithm in more complex environments, along with the parameterization of communication. Hence, this codebase is forked from openai/maddpg and openai/multiagent-particle-envs.

To set up this repository, please see the instructions in ENVIRONMENT.md. This is the README from the Multi-Agent Particle Environment repository.

Our additions include two complex cooperative/competitive environments involving communication. These are:

multiagent/scenarios/adversary_simple_listener.py. An adversarial agent is added to simple_speaker_listener. The speaker/listener pair must learn encrypt communication to reach the goal while preventing an adversary from doing the same.
multiagent/scenarios/commplex_tag.py. Predators are given communication abilities in simple_tag but have vision restricted and the environment becomes partially observable.

Above: Our adversarial speaker-listener environment. The small circles are the landmarks. In each episode, a random landmark is chosen to be the target. Only the speaker (grey) knows the target's identity. The speaker cannot move, instead, it can only communicate over a public channel to a listener and adversary (purple). The listener and adversary both move during the episode to minimize their distance to the target. The speaker must use a private key (shared with only the listener) and public communication to tell the listener the identity of the target landmark without disclosing it to the adversary. In the figure above, the listener is colored a light shade of the target. As shown above, when the adversary receives noise instead of communication, it learns to find the centroid of the three landmarks.

Name		Name	Last commit message	Last commit date
Latest commit History 98 Commits
bin		bin
experiments		experiments
img		img
maddpg		maddpg
multiagent		multiagent
.gitignore		.gitignore
ENVIRONMENT.md		ENVIRONMENT.md
LICENSE.txt		LICENSE.txt
README.md		README.md
experiments.txt		experiments.txt
make_env.py		make_env.py
setup.py		setup.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

multiagent-comm

About

Releases

Packages

Contributors 3

Languages

License

fractal1729/multiagent-comm

Folders and files

Latest commit

History

Repository files navigation

multiagent-comm

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Contributors 3

Languages

Packages