Team-based Multi-agent Reinforcement Learning

Abstract

In multi-agent reinforcement learning (MARL), differentiating between agent intelligence and organization intelligence may hold the key to major breakthroughs.

This project separates the encoding of agent intelligence from organization intelligence. Agents are programmed with a simple naive algorithm, but they are organized under teams and provided with US versus THEM context. The organization intelligence is separately encoded in the team’s culture, which determines how team rewards are doled out to its agents on top of the environmental reward they gather during training.

With the separation of agent and organization intelligences, the methodology becomes mathematically and computationally simple. It can scale easily with the number of agents and teams and it enables teams of agents to achieve a wide range of desired results and behaviors with only slight changes to the team culture and no change to the agents’ policy algorithm.

The new approach enables teams of agents to easily exceed the performance of agents trained under “state-of-the art” MARL algorithms. In addition, the use of team reward in culture can lead to agent specialization, which enables a team of specialized agents to build a dominating strategy to a game which is previously intransitive to multiple individual agents.

Installation

pip install -r requirements.txt

Name		Name	Last commit message	Last commit date
Latest commit History 8 Commits
.ipynb_checkpoints		.ipynb_checkpoints
MA_models		MA_models
__pycache__		__pycache__
images		images
maps		maps
report		report
results		results
videos		videos
3T-8L1R_Culture_Tuning.ipynb		3T-8L1R_Culture_Tuning.ipynb
3T-8L1R_TB_MARL_Behaviors.ipynb		3T-8L1R_TB_MARL_Behaviors.ipynb
3T-8L1R_Warlike_Training.ipynb		3T-8L1R_Warlike_Training.ipynb
3T-9L1R_Culture_Tuning.ipynb		3T-9L1R_Culture_Tuning.ipynb
3T-9L1R_Performance_Assessment.ipynb		3T-9L1R_Performance_Assessment.ipynb
3T-9L1R_Warlike_Training.ipynb		3T-9L1R_Warlike_Training.ipynb
3T-9L_Culture_Tuning.ipynb		3T-9L_Culture_Tuning.ipynb
3T-9L_Performance_Assessment.ipynb		3T-9L_Performance_Assessment.ipynb
3T-9L_Warlike_Training.ipynb		3T-9L_Warlike_Training.ipynb
Display_Train_Results.ipynb		Display_Train_Results.ipynb
Gathering_Team_MARL.ipynb		Gathering_Team_MARL.ipynb
README.md		README.md
TB_MARL_Behaviors.ipynb		TB_MARL_Behaviors.ipynb
Tournaments.ipynb		Tournaments.ipynb
requirements.txt		requirements.txt
tribes_env.py		tribes_env.py
tribes_model.py		tribes_model.py

LUKELIEM/team_marl

Folders and files

Latest commit

History

Repository files navigation

Team-based Multi-agent Reinforcement Learning

Abstract

Installation

About

Topics

Resources

Stars

Watchers

Forks

Languages