Skip to content

Model-free RL exploration based on states perturbation

Notifications You must be signed in to change notification settings

Curiouskid0423/rho_exploration

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

13 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Neighboring States-based RL Exploration

Repo for this ArXiv paper. All training scripts are located in the scripts/ directory. To be updated with more extensive experiments on different agents (e.g. SAC, PPO).

Hyperparameters to consider (as of Dec 2022)

Hyperparameter values
learning_rate 1e-3, 5e-4
rho (perturbation margin) 3, 5, 7, 10, 20%
rho_sample 10, 20, 30
lambda (look ahead steps) 1, 10
sample_heuristics max, mode of top percentile
model-free algo DQN (Actor Critic, SAC, PPO)

About

Model-free RL exploration based on states perturbation

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published