Multi_Agent_Soft_Actor_Critic

A Pytorch Implementation of Multi Agent Soft Actor Critic

Project Details

The environment consists of multiple agents where the task of the agent hit the ball and keep it in the air without allowing it to fall on the ground.
The current state of the environment is represented by 24 dimensional feature vector which conist the position of the ball and speed of the ball
A reward of +0.1 is provided for time the agent's hits the ball and -0.1 if the agent miss it or shoots the ball away from the court.
The task is episoidic, and in order to solve the environment, the agent must get an average score of +0.5 over 100 consecutive episodes

Technical Dependencies

Python 3.6 :
PyTorch (0.4,CUDA 9.0) : pip3 install torch torchvision
ML-agents (0.4) : Refer to ml-agents for installation
Numpy (1.14.5) : pip3 install numpy
Matplotlib (3.0.2) : pip3 install matplotlib
Jupyter notebook : pip3 install jupyter
Download the environment from here and place it in the same folder as that of Tennis.ipynb file

Network details

Installation Instructions :

step 1 : Install all the dependencies
step 2 : git clone https://github.com/adithya-subramanian/Multi_Agent_Soft_Actor_Critic.git
step 3 : jupyter notebook
step 4 : Run all cells in the Tennis.ipynb file

Acknowledgment

Certain parts of SAC.py,model.py and Tennis.ipynb has been partially taken from the Udacity's deep reinforcement learning Nanodegree.

Name		Name	Last commit message	Last commit date
Latest commit History 11 Commits
README.md		README.md
Report.pdf		Report.pdf
SAC.py		SAC.py
Tennis.ipynb		Tennis.ipynb
checkpoint_actor_agent_1.pth		checkpoint_actor_agent_1.pth
checkpoint_actor_agent_2.pth		checkpoint_actor_agent_2.pth
checkpoint_critic_q_1_local_agent_1.pth		checkpoint_critic_q_1_local_agent_1.pth
checkpoint_critic_q_1_local_agent_2.pth		checkpoint_critic_q_1_local_agent_2.pth
checkpoint_critic_q_2_local_agent_1.pth		checkpoint_critic_q_2_local_agent_1.pth
checkpoint_critic_q_2_local_agent_2.pth		checkpoint_critic_q_2_local_agent_2.pth
checkpoint_critic_v_local_agent_1.pth		checkpoint_critic_v_local_agent_1.pth
checkpoint_critic_v_local_agent_2.pth		checkpoint_critic_v_local_agent_2.pth
checkpoint_critic_v_target_agent_1.pth		checkpoint_critic_v_target_agent_1.pth
checkpoint_critic_v_target_agent_2.pth		checkpoint_critic_v_target_agent_2.pth
model.py		model.py

adithya-subramanian/Multi_Agent_Soft_Actor_Critic

Folders and files

Latest commit

History

Repository files navigation

Multi_Agent_Soft_Actor_Critic

Project Details

Technical Dependencies

Network details

Installation Instructions :

Acknowledgment

About

Topics

Resources

Stars

Watchers

Forks

Languages