Skip to content

h-aboutalebi/SparceReward

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

SparceReward

This package provides comprehensive tools for examining rl algorithms on sparce and non-sparce environments.

At current version, we only offer Mujoco environment for tests. Current supported algorithms are as follows:

1- https://arxiv.org/pdf/1801.01290.pdf

2- https://arxiv.org/pdf/1706.01905.pdf

3- https://arxiv.org/pdf/1509.02971.pdf

4- ...

In the directory engine/reward_modifier, we provide different sparcity version for environments.

The typical code to run the program is as follows:

python3 main.py

If you want to change the environment you should type:

python3 main.py --env_name Ant-v2 The environments by defualt are non-sparce. You can make argument sparce by using the following command:

python3 main.py --env_name Ant-v2 --sparse_reward --threshold_sparcity 0.05 where --sparse_reward makes the environment's reward sparce and --threshold_sparcity determines the extent of sparcity.

If you want to change the algorithm to run:

python3 main.py --env_name --algo SAC

Defualt algorithm is DDPG. Current supported algorithms are: --algo SAC, --algo DDPG, --algo DDPG_PARAM

If you want to change the parameters of specific algorithm (for example SAC) you should write:

python3 main.py --env_name --algo SAC --tau_sac 1

which changes the tau parameter of SAC algorithm. Detailed information regarding the different setting of algorithms can be found in main.py argparser. We have provided detailed documentation for each parameter.

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages