Value Disagreement Sampling (VDS)

This codebase is adapted from Openai baselines.

Run experiments

In project directory, run

python -m baselines.ve_run --alg=her --env=FetchPush-v1 --num_timesteps=500000 \
--size_ensemble=3 --log_path=./data/test_fetch_push

Baselines

OpenAI Baselines is a set of high-quality implementations of reinforcement learning algorithms.

These algorithms will make it easier for the research community to replicate, refine, and identify new ideas, and will create good baselines to build research on top of. Our DQN implementation and its variants are roughly on par with the scores in published papers. We expect they will be used as a base around which new ideas can be added, and as a tool for comparing a new approach against existing ones.

Prerequisites

Baselines requires python3 (>=3.5) with the development headers. You'll also need system packages CMake, OpenMPI and zlib. Those can be installed as follows

Ubuntu

sudo apt-get update && sudo apt-get install cmake libopenmpi-dev python3-dev zlib1g-dev

Mac OS X

Installation of system packages on Mac requires Homebrew. With Homebrew installed, run the following:

brew install cmake openmpi

Virtual environment

From the general python package sanity perspective, it is a good idea to use virtual environments (virtualenvs) to make sure packages from different projects do not interfere with each other. You can install virtualenv (which is itself a pip package) via

pip install virtualenv

Virtualenvs are essentially folders that have copies of python executable and all python packages. To create a virtualenv called venv with python3, one runs

virtualenv /path/to/venv --python=python3

To activate a virtualenv:

. /path/to/venv/bin/activate

More thorough tutorial on virtualenvs and options can be found here

Tensorflow versions

The master branch supports Tensorflow from version 1.4 to 1.14. For Tensorflow 2.0 support, please use tf2 branch.

Installation

Clone the repo and cd into it:

git clone https://github.com/openai/baselines.git
cd baselines

If you don't have TensorFlow installed already, install your favourite flavor of TensorFlow. In most cases, you may use
```
pip install tensorflow-gpu==1.14 # if you have a CUDA-compatible gpu and proper drivers
```
or
```
pip install tensorflow==1.14
```
to install Tensorflow 1.14, which is the latest version of Tensorflow supported by the master branch. Refer to TensorFlow installation guide for more details.
Install baselines package
```
pip install -e .
```

MuJoCo

Some of the baselines examples use MuJoCo (multi-joint dynamics in contact) physics simulator, which is proprietary and requires binaries and a license (temporary 30-day license can be obtained from www.mujoco.org). Instructions on setting up MuJoCo can be found here

Name		Name	Last commit message	Last commit date
Latest commit History 1 Commits
baselines		baselines
.gitignore		.gitignore
.travis.yml		.travis.yml
Dockerfile		Dockerfile
README.md		README.md
setup.cfg		setup.cfg
setup.py		setup.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

baselines

baselines

.gitignore

.gitignore

.travis.yml

.travis.yml

Dockerfile

Dockerfile

README.md

README.md

setup.cfg

setup.cfg

setup.py

setup.py

Repository files navigation

Value Disagreement Sampling (VDS)

Run experiments

Baselines

Prerequisites

Ubuntu

Mac OS X

Virtual environment

Tensorflow versions

Installation

MuJoCo

About

Releases

Packages

Languages

zzyunzhi/vds

Folders and files

Latest commit

History

Repository files navigation

Value Disagreement Sampling (VDS)

Run experiments

Baselines

Prerequisites

Ubuntu

Mac OS X

Virtual environment

Tensorflow versions

Installation

MuJoCo

About

Topics

Resources

Stars

Watchers

Forks

Languages