Hybrid-Agent

On/off-policy hybrid agent and algorithm with LSTM network and tensorflow. A method of hybrid agent and training algorithm using both on-policy loss function and off-policy loss function, reference to DDPG(http://arxiv.org/abs/1509.02971) and DPPO(http://arxiv.org/abs/1707.06347).

Require tensorflow, openAI gym and mujoco to train the agent.

Start Training

To start training a agent, run testrun.py. Tune the parameters in this file as you like.Either to train a new agent with Restore_iter = None or restore network weights with Restore_iter. Tensorflow ckpt files will be saved in tf_saver, video of environments will be saved in video, and replay buffer's data will be saved in replays.

Name		Name	Last commit message	Last commit date
Latest commit History 10 Commits
tf_saver		tf_saver
README.md		README.md
agent.py		agent.py
example.mp4		example.mp4
replay_buffer.py		replay_buffer.py
resume.pdf		resume.pdf
testrun.py		testrun.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

tf_saver

tf_saver

README.md

README.md

agent.py

agent.py

example.mp4

example.mp4

replay_buffer.py

replay_buffer.py

resume.pdf

resume.pdf

testrun.py

testrun.py

Repository files navigation

Hybrid-Agent

Start Training

About

Releases

Packages

Languages

Crevass/Hybrid-Agent

Folders and files

Latest commit

History

Repository files navigation

Hybrid-Agent

Start Training

About

Resources

Stars

Watchers

Forks

Languages