Instance Weighted Incremental Evolution Strategies (IW-IES)

This repo contains code accompanying the paper: Zhi Wang, Chunlin Chen, and Daoyi Dong, "Instance Weighted Incremental Evolution Strategies for Reinforcement Learning in Dynamic Environments", IEEE Transactions on Neural Networks and Learning Systems, 2022. It contains code for running the incremental learning tasks, including 2D navigation, Swimmer, Hopper, and HalfCheetah domains. The basic reinforcement learning algorithms are implemented using natural evolution strategies.

Dependencies

This code requires the following:

python 3.5+
pytorch 0.4+
gym
MuJoCo license

Data

For the 2D navigation domain, data is generated from envs/navigation.py
For the Swimmer/Hopper/HalfCheetah Mujoco domains, the modified Mujoco environments are in envs/mujoco/*

Usage

For example, in Case I of the navigation domain, just run the bash script navi_v1_iwies.sh to get the results of iwies and its ablation methods, also see the usage instructions in the script and main.py; just run the bash script navi_v1_baselines.sh to get the results of the baselines including Robust, Hist, SO-CMA, and ES-MAML, also see the usage instructions in the script and baselines.py
When getting the results in output/*/*.npy files, plot the results using data_process.py. For example, the results for the navigation domains are as follows:

Case I	Case II	Complex Case

Note that these results are from a single run of the code. You can randomly change the environment to a new one, and record the performance of all tested methods when adapting to the new environment. In our paper, we repeat the process ten times and report the mean and standard error to demonstrate the performance for learning in stochastic dynamic environments. For example, the results for Case I of navigation domain and the swimmer domain are as follows:

navigation_v1	swimmer

Also, the results for other demo scripts are shown in exp/*

Contact

To ask questions or report issues, please open an issue on the issues tracker, or email to zhiwang@nju.edu.cn.

Name		Name	Last commit message	Last commit date
Latest commit History 41 Commits
envs		envs
exp		exp
output		output
saves		saves
.DS_Store		.DS_Store
README.md		README.md
baselines.py		baselines.py
cheetah_baselines.sh		cheetah_baselines.sh
cheetah_iwies.sh		cheetah_iwies.sh
data_process.py		data_process.py
es.py		es.py
es_baselines.py		es_baselines.py
hopper_baselines.sh		hopper_baselines.sh
hopper_iwies.sh		hopper_iwies.sh
main.py		main.py
models.py		models.py
navi_v1_baselines.sh		navi_v1_baselines.sh
navi_v1_iwies.sh		navi_v1_iwies.sh
navi_v2_baselines.sh		navi_v2_baselines.sh
navi_v2_iwies.sh		navi_v2_iwies.sh
navi_v3_baselines.sh		navi_v3_baselines.sh
navi_v3_iwies.sh		navi_v3_iwies.sh
novelty.py		novelty.py
swimmer_baselines.sh		swimmer_baselines.sh
swimmer_iwies.sh		swimmer_iwies.sh

HeyuanMingong/iwies

Folders and files

Latest commit

History

Repository files navigation

Instance Weighted Incremental Evolution Strategies (IW-IES)

Dependencies

Data

Usage

Contact

About

Resources

Stars

Watchers

Forks

Languages