--Self Generate Expert Experience/SGEE

This code combine DDPG Algorithm and Behavior Clone methods,which integrate off and on-policy training process. After one episode on-policy train, algorithm generate expert samples with current parameters and feed the off-policy train. For it can produce expert experient by itself, so we call it SGEE. The implementation of DDPG refer to sweetice's code.>>https://github.com/sweetice/Deep-reinforcement-learning-with-pytorch

Reference CONTINUOUS CONTROL WITH DEEP REINFORCEMENT LEARNING>>https://arxiv.org/abs/1509.02971 Self Lmitation Learning>>https://arxiv.org/abs/1806.05635

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

README.md

README.md

--Self Generate Expert Experience/SGEE

Files

README.md

Latest commit

History

README.md

File metadata and controls

--Self Generate Expert Experience/SGEE