GitHub - tgangwani/QDAgents: Pytorch code for "Harnessing Distribution Ratio Estimators for Learning Agents with Quality and Diversity", (CoRL 2020)

This repository contains code for our paper Harnessing Distribution Ratio Estimators for Learning Agents with Quality and Diversity, published at the Conference on Robot Learning (CoRL), 2020.

The code heavily uses the RL machinery from this awesome repository with RL algorithms implemented in PyTorch. We also use some functionality from OpenAI baselines. Different policies of the ensemble run in separate MPI ranks and communicate using the MPI protocol. We additionally provide a self-imitation option for the policy gradient, as proposed in Learning Self-Imitating Diverse Policies.

The code was tested with the following packages:

python 3.6.6
pytorch 0.4.1
gym 0.10.8
mpi4py 3.0.0

Running command

To run MuJoCo experiments, use the command below. Edit default_config.yaml to change the hyperparameters.

mpirun -np $MPI_RANKS python main.py --env-name "SparseCheetah-v2" --config-file "default_config.yaml" --seed=$RANDOM

The "SparseCheetah-v2" environment is created by modifying the "HalfCheetah-v2" from OpenAI-Gym. Please see the paper for details.

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
qd_agents		qd_agents
README.md		README.md
default_config.yaml		default_config.yaml
main.py		main.py
run_script.sh		run_script.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

qd_agents

qd_agents

README.md

README.md

default_config.yaml

default_config.yaml

main.py

main.py

run_script.sh

run_script.sh

Repository files navigation

Running command

Credits

About

Releases

Packages

Languages

tgangwani/QDAgents

Folders and files

Latest commit

History

Repository files navigation

Running command

Credits

About

Topics

Resources

Stars

Watchers

Forks

Languages