MAGE 🔮

The authors' implementation of the MAGE algorithm from "How to Learn a Useful Critic? Model-based Action-Gradient-Estimator Policy Optimization".

Cite as

Pierluca D'Oro, Wojciech Jaśkowski. "How to Learn a Useful Critic? Model-based Action-Gradient-Estimator Policy Optimization". In: NeurIPS, 2020.

@inproceedings{doro2020howto,
    title={How to Learn a Useful Critic? Model-based Action-Gradient-Estimator Policy Optimization},
    author={D'Oro, Pierluca and Ja{\'s}kowski, Wojciech},
    booktitle={NeurIPS},
    year={2020},
  }

Install

You should already have them, but just in case, install the libs:

sudo apt install libosmesa6-dev libgl1-mesa-glx libglfw3 patchelf

Create conda environment with the required dependencies:
```
conda env create -f conda_env.yml
```

Download and setup MuJoCo 1.50:

mkdir ~/.mujoco/
cd .mujoco/
wget -c https://www.roboti.us/download/mujoco150_linux.zip
unzip mjpro150_linux.zip
rm mjpro150_linux.zip

Obtain MuJoCo license key and place it .mujoco/ directory created above with filename mjkey.txt.

Append the following to ~/.bashrc:

# MuJoCo
if [ -f /usr/lib/x86_64-linux-gnu/libGLEW.so ]; then
    export LD_LIBRARY_PATH=$LD_LIBRARY_PATH:$HOME/.mujoco/mujoco150/bin:/usr/lib/nvidia-390:/usr/lib/nvidia-375
    export LD_PRELOAD=/usr/lib/x86_64-linux-gnu/libGLEW.so
fi

Test the MuJoCo installation:

>>> import gym
>>> gym.make('HalfCheetah-v2')

(Optional) Create a neptune.ai account for logging. Setup your Neptune:

export NEPTUNE_API_TOKEN=<your neptune.ai token>

Running

MAGE-TD3 (ours):

python main.py with env_name=GYMMB_HalfCheetah-v2 agent_alg=td3 tdg_error_weight=5. td_error_weight=1. neptune_project=<optionally_your_neptune_project_name>

(Note: tdg_error_weight=5 corresponds to lambda=0.2 in the paper)

Dyna-TD3 (model-based baseline):

python main.py with env_name=GYMMB_HalfCheetah-v2 agent_alg=td3 tdg_error_weight=0. td_error_weight=1. neptune_project=<optionally_your_neptune_project_name>

Name		Name	Last commit message	Last commit date
Latest commit History 11 Commits
envs		envs
.gitignore		.gitignore
README.md		README.md
buffer.py		buffer.py
conda_env.yml		conda_env.yml
ddpg.py		ddpg.py
env_loop.py		env_loop.py
imagination.py		imagination.py
logger.py		logger.py
mage_logo.png		mage_logo.png
main.py		main.py
metriclogger.py		metriclogger.py
models.py		models.py
normalizer.py		normalizer.py
radam.py		radam.py
regression_tests.py		regression_tests.py
reward_model.py		reward_model.py
sacred_utils.py		sacred_utils.py
td3.py		td3.py
utils.py		utils.py
video_recorder.py		video_recorder.py
wrappers.py		wrappers.py

nnaisense/MAGE

Folders and files

Latest commit

History

Repository files navigation

MAGE 🔮

Cite as

Install

Running

About

Resources

Stars

Watchers

Forks

Languages