GitHub - BSski/HIVE: :honeybee: Neural network-driven multi-agent Prisoner's Dilemmas.

WARNING: VERY OLD CODE
Most of the code was written in 2020. I have learned a lot since then and I am aware of the poor quality of the code.

HIVE

📜 Project description

Platform created to facilitate conducting spatial multi-agent iterated prisoner's dilemma experiments between groups controlled by RL algorithm incorporating artificial neural networks. Uses Double Dueling Deep Q-Network, a deep reinforcement learning algorithm.

Based on keras-rl2.
Simultaneous agent inspired by interleaved.py by Velochy.

🔨 Technologies used

Python 3.7.11
Pygame 1.9.6
Numpy 1.18.5
OpenAI gym 0.17.2
Matplotlib 3.3.0
Keras_rl2 1.0.4
Tensorflow 2.3.0
Theano 1.0.5

⬆️ Room for improvement

This is an old project of mine and it would certainly benefit from:

refactoring the code into better functions,
refactoring the code into different files,
getting rid of many anti-patterns,
having tests written for it.

📞 Contact

contact.bsski@gmail.com

📈 Changelog:

📅 01.02.2021

no GUI version for faster experiments

📅 03.01.2021

board's nature is hexagonal now instead of rectangular
added optional controllable debug agent

📅 23.12.2020

the program is running in cycles now, X simulations in a row
added appending data to .csv files after each simulation

📅 10.12.2020

using DQN instead of DDPG (DDPG couldn't handle TFT)
UI changes

📅 03.12.2020

using DDPG instead of DQN now
agents have parameters that impact their actions

📅 21.11.2020

agents now exist on 2d 43x43 plane and interact when on the same tile
added visualisation done in pygame

📅 04.11.2020

more than two players can play now (interleaved training)
added 7 NPCs with popular PD strategies
players now include current episode step and type of the enemy in their observations

📅 28.10.2020

created stable, developable version of two neural nets playing PD against each other using:
- custom OpenAI gym
- custom keras-rl2 agent
- custom keras-rl2 callbacks

📅 21.09.2020

created OpenAI custom gym environment
connected it to keras-rl2, doesn't work yet, need to create a proper network for it

📅 16.09.2020

created the repository
added prisoner's dilemma basic mechanism

👷 Author

@BSski

🔓 License

MIT

Name		Name	Last commit message	Last commit date
Latest commit History 162 Commits
Sample experiments data		Sample experiments data
csv output		csv output
gym-tdh		gym-tdh
npcs		npcs
simagent		simagent
sprites		sprites
HIVE GUI.png		HIVE GUI.png
LICENSE		LICENSE
PL HIVE Presentation.pdf		PL HIVE Presentation.pdf
README.md		README.md
callbacks.py		callbacks.py
dqn.py		dqn.py
grid.py		grid.py
main_nn.py		main_nn.py
positions.py		positions.py
requirements.txt		requirements.txt

License

BSski/HIVE

Folders and files

Latest commit

History

Repository files navigation

HIVE

Table of contents

📜 Project description

🔨 Technologies used

⬆️ Room for improvement

📞 Contact

📈 Changelog:

👷 Author

🔓 License

About

Topics

Resources

License

Stars

Watchers

Forks

Languages