Federated Reinforcement Learning

We try to allow multiple reinforcement learning agents to learn optimal control policy on their own IoT devices of the same type but with slightly different dynamics. For such multiple IoT devices, there is no guarantee that an agent who interacts only with one IoT device and learns the optimal control policy will also control another IoT device well. Therefore, we may need to apply independent reinforcement learning to each IoT device individually, which requires a costly or time-consuming effort.

To solve this problem, we propose a new federated reinforcement learning architecture where each agent working on its independent IoT device shares their learning experience (i.e., the gradient of loss function) with each other, and transfers a mature policy model parameters into other agents. They accelerate its learning process by using mature parameters. We incorporate the Actor-Critic PPO algorithm into each agent in the proposed collaborative architecture and propose an efficient procedure for the gradient sharing and the model transfer.

We use Quanser's Qube-servo 2 as the real device. We also used CartPole from OpenAI Gym as the simulation environment.

Execute the Proposed Federated Reinforcement Learning in Simulation Environment

Set-up the hyper-parameter
- modify main_constants.py in "rl_main" folder
  - ex) the number of workers, Whether or not to use the gradient sharing, Whether or not to use the transfer learning, and PPO's hyper-parameters
Execution
- python main.py

Environment Configuration

1. Create Environment

conda create -n rl python=3.6
conda activate rl
pip install --upgrade pip
pip install -r requirements.txt
pytorch install
- https://pytorch.org/ reference
baselines install
- https://github.com/openai/baselines reference

2. OpenAI Gym Install

git clone https://github.com/openai/gym.git
cd gym
pip install -e '.[all]'
- ignore mujoco error

3. Package Install & requirements.txt Configuration

pip freeze > requirements.txt

4. Mosquitto Install

Mosquitto install
- brew install mosquitto
- Linux: https://blog.neonkid.xyz/127
Execute the mosquitto service
- For Mac
  - /usr/local/sbin/mosquitto -c /usr/local/etc/mosquitto/mosquitto.conf
- For Linux
  - mosquitto
Test the subscribe
- mosquitto_sub -h [address] -p [port] -t [topic]
- mosquitto_sub -h 127.0.0.1 -p 1883 -t "topic"
Test the publication
- mosquitto_pub -h [address] -p [port] -t [topic] -m [content]
- mosquitto_pub -h 127.0.0.1 -p 1883 -t "topic" -m "test messgae"

5. Execution

Execute chief
- python main_only_chief.py
Execute worker
- python main_only_one_worker.py
Execute main for the test
- OpenAI Gym CartPole
- python main.py

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
doc		doc
graphs		graphs
logs		logs
rl_main		rl_main
README.md		README.md
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

doc

doc

graphs

graphs

logs

logs

rl_main

rl_main

README.md

README.md

requirements.txt

requirements.txt

Repository files navigation

Federated Reinforcement Learning

Execute the Proposed Federated Reinforcement Learning in Simulation Environment

Environment Configuration

1. Create Environment

2. OpenAI Gym Install

3. Package Install & requirements.txt Configuration

4. Mosquitto Install

5. Execution

Reference

About

Releases

Packages

Languages

glenn89/FederatedRL

Folders and files

Latest commit

History

Repository files navigation

Federated Reinforcement Learning

Execute the Proposed Federated Reinforcement Learning in Simulation Environment

Environment Configuration

1. Create Environment

2. OpenAI Gym Install

3. Package Install & requirements.txt Configuration

4. Mosquitto Install

5. Execution

Reference

About

Resources

Stars

Watchers

Forks

Languages