ProxHSPGA

Introduction

This package is the implementation of ProxHSPGA and its non-composite variant to solve several optimization problems in reinforcement learning.

Dependency

Install rllab following instructions from rllab installation.
Install latest version of Theano and Lasagne, somehow the stable versions do not work.

pip install --upgrade https://github.com/Theano/Theano/archive/master.zip
pip install --upgrade https://github.com/Lasagne/Lasagne/archive/master.zip

Note

For Mujoco environments, you need to acquire a license for mujoco-py installation.

Code Usage

We hope that this program will be useful to others, and we would like to hear about your experience with it. If you found it helpful and are using it within our software you are highly encouraged to cite the following publication:

N. H. Pham, L. M. Nguyen, D. T. Phan, P. H. Nguyen, M. van Dijk, and Q. Tran-Dinh, A Hybrid Stochastic Policy Gradient Algorithm for Reinforcement Learning, The 23rd International Conference on Artificial Intelligence and Statistics (AISTATS 2020), Palermo, Italy, 2020.

Feel free to send feedback and questions about the package to our maintainer Nhan H. Pham at nhanph@live.unc.edu.

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
save_model		save_model
utils		utils
.gitignore		.gitignore
Acrobot_HSPGA.py		Acrobot_HSPGA.py
CartPole_HSPGA.py		CartPole_HSPGA.py
Hopper_HSPGA.py		Hopper_HSPGA.py
Hopper_ProxHSPGA.py		Hopper_ProxHSPGA.py
LICENSE		LICENSE
MtnCar_HSPGA.py		MtnCar_HSPGA.py
README.md		README.md
RoboschoolInvertedPendulum_HSPGA.py		RoboschoolInvertedPendulum_HSPGA.py
RoboschoolInvertedPendulum_ProxHSPGA.py		RoboschoolInvertedPendulum_ProxHSPGA.py
Swimmer_HSPGA.py		Swimmer_HSPGA.py
Swimmer_ProxHSPGA.py		Swimmer_ProxHSPGA.py
Walker_HSPGA.py		Walker_HSPGA.py
Walker_ProxHSPGA.py		Walker_ProxHSPGA.py

License

unc-optimization/ProxHSPGA

Folders and files

Latest commit

History

Repository files navigation

ProxHSPGA

Introduction

Dependency

Code Usage

About

Resources

License

Stars

Watchers

Forks

Languages