GitHub - mbhenaff/neural-e3

Explicit Explore-Exploit Algorithms in Continuous State Spaces

Code for the Neural-E3 algorithm described in the paper Explicit Explore-Exploit Algorithms in Continuous State Spaces. All code was written with Python 3.6 and PyTorch v1.1.0.

To run Neural-E3 on the combination lock tasks, do:

cd combolock/
python Experiment.py --alg neural-e3 --horizon 5 --lr 0.01 --n_playouts 200 --n_ensemble 5 --seed 1 --env_param_1 0.1

To run Neural-E3 on continuous control tasks, do:

python train_e3.py -env {mountaincar, acrobot} -n_exploration_epochs 10 -n_trajectories 10 -eps 0.1 -n_model_updates 200 -n_exploration_epochs 10

To run it on maze tasks, do:

cd src/
python train_e3.py -env {maze5, maze10, maze20} -n_exploration_epochs 10  -lambda_r 0.01

The code for running the baseline algorithms (DQN, PPO and PPO+RND) is in the 'baselines' subfolder. The DQN uses the OpenAI Baselines repo and the PPO+RND code is built on the DeepRL Pytorch repo. The combination lock environment is built on the codebase of Du et al..

Name		Name	Last commit message	Last commit date
Latest commit History 11 Commits
baselines		baselines
combolock		combolock
README		README
README.md		README.md
agents.py		agents.py
arguments.py		arguments.py
environment.py		environment.py
experience.py		experience.py
maze.py		maze.py
models.py		models.py
plotting.py		plotting.py
search.py		search.py
train_e3.py		train_e3.py
utils.py		utils.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

baselines

baselines

combolock

combolock

README

README

README.md

README.md

agents.py

agents.py

arguments.py

arguments.py

environment.py

environment.py

experience.py

experience.py

maze.py

maze.py

models.py

models.py

plotting.py

plotting.py

search.py

search.py

train_e3.py

train_e3.py

utils.py

utils.py

Repository files navigation

Explicit Explore-Exploit Algorithms in Continuous State Spaces

About

Releases

Packages

Contributors 2

Languages

mbhenaff/neural-e3

Folders and files

Latest commit

History

Repository files navigation

Explicit Explore-Exploit Algorithms in Continuous State Spaces

About

Topics

Resources

Stars

Watchers

Forks

Languages