How to run it ?

clone this repo and go to protocols/ to select a protocol for your experiment. Let's say you have chosen protocol 7. Then execute

cd RL
virtualenv venv
source venv/bin/activate
pip install -r requirements.txt
python . p

How does it work ?

In this repo you can use/implement your own Reinforcement Learning algorithms using a Hierarchical approach (see for instance this post on thegradient.

Hierarchical learning is a method to solve efficiently RL problems, especially when the exploration of your environment is hard. The main idea is to break down a complex task into smaller ones: "HRL [...] operates on different levels of temporal abstraction". The manager plans abstract macro action and activates workers (also called options) to accomplish each of them, those interact directly .with the environment. The manager can also call a special option for exploring the environment.

Within this repo, you can create your own HRL algorithm by inheriting from AbstractManager class (see abstract/manager/manager.py file). Then, you just need to code your own manager's policy, the options' policy and the exploring option. Once this is done, create your own protocol, give it a name, set the relevant parameters and execute python . name-of-your-protocol at the root of the project. In the abstract classes (in folder abstract), you will find the methods that you have to implement for the manager and the options.

Some examples are available in the folders baseline. Also, in a2c, we use the a2c algorithm for the options and a planning strategy for the manager, at the top level. We are open to receive any new managers or options to expand our set of baseline strategies.

About the environment and the manager's abstract representation

Your environment has to inherit from gym. You can set its name in your protocol's parameters. Our manager and options use the information given by then environment through the function step. This function returns (among others) the new state of the environment when a certain action is performed from an another state. You will need to transform these states to feed the states list of your agent's policy. For this purpose, we have created an abstract wrapper class ObsPixelWrapper (see also some examples in folder wrapper). You can use this class to, for instance, transform the pixels of your state to make a new gray-scaled-downsampled state.

Name		Name	Last commit message	Last commit date
Latest commit History 102 Commits
a2c		a2c
abstract		abstract
baseline		baseline
cluster_utils		cluster_utils
protocols		protocols
tests		tests
wrapper		wrapper
.gitignore		.gitignore
KeyboardAgent.py		KeyboardAgent.py
README.md		README.md
__main__.py		__main__.py
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

a2c

a2c

abstract

abstract

baseline

baseline

cluster_utils

cluster_utils

protocols

protocols

tests

tests

wrapper

wrapper

.gitignore

.gitignore

KeyboardAgent.py

KeyboardAgent.py

README.md

README.md

main.py

main.py

requirements.txt

requirements.txt

Repository files navigation

How to run it ?

How does it work ?

About the environment and the manager's abstract representation

About

Releases

Packages

Languages

aig-upf/Partition-HRL

Folders and files

Latest commit

History

Repository files navigation

How to run it ?

How does it work ?

About the environment and the manager's abstract representation

About

Resources

Stars

Watchers

Forks

Languages