Skip to content

VasaKiDD/TD3-deep-rl-research

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

7 Commits
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

TD3-deep-rl-research

This repo is originally cloned from https://github.com/sfujim/TD3 which implement TD3 Deep Reinforcement algorithm along with a re-implementation of DDPG.

The idea was to try improve the buffers with entropy maximisation of experience selection combined with a prioritized experience replay:

I used the principle of Virtual Reward, which combined entropy maximisation of observation (or states) with temporal difference errors in the buffer. The initial intuition is that it will reduce varience in the training of the agent, because the training step will try to maximize jointly diversity in observation and error in value function prediction.

For DDPG/TD3 with entropy/error maximisation training:

python train/main_faimemory.py

For original TD3/DDP training:

python train/main.py

About

Playing along with buffer in TD3 Deep Reinforcement learning

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published